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Description 

This application is a continuation-in-part of Looker and Hoffman. U.S. Ser. No. 07/374,161, DI-ALPHA AND DI-BETA 
GLOBIN LIKE POLYPEPTIDES AND USES THEREFOR, filed June 30, 1989; Stetler and Wagenbach. U S, Ser. No. 
07/379.1 16, PRODUCTION OF HUMAN HEMOGLOBIN BY TRANSFORMED YEAST CELLS, filed July 13. 1989; and 
Hoffman, Looker, Rosendahl and Stetler. U.S. Ser. No 07/349,623. POLYCISTRONIC CO-EXPRESSION OF THE 
ALPHA- AND BETAGLOBINS AND IN VIVO ASSEMBLY OF BIOLOGICALLY ACTIVE, TETRAMERIC HEMOGLOBIN, 
filed May 10, 1989; all owned by Somatogenetics International, Inc. 

Cross-Referenceto Related Applications 

Hoffman and Nagai. U.S. Ser. No. 07/1 94,338. filed May 10, 1988, presently owned by Somatogenetics International, 
Inc., relates to the use of low oxygen affinity mutant hemoglobins as blood substitutes, and to the expression of alpha 
and beta globin in nonerythroid cells. 

Field of the Invention 

The present invention relates to the intracellular assembly of a human hemoglobin-like protein in biologically func- 
tional, substantially soluble form through co-expression of alpha- and beta-globin-like polypeptides in bacterial or yeast 
celts. 

It further relates to the genetic fusing of the two alpha subunits of hemoglobin to form a novel polypeptide, di-alpha 
globin. which may be considered a partially assembled intermediate leading to a hemoglobin-like protein, and the use 
of this compound in the production of synthetic hemoglobins having an increased intravascular half-life as corrpared to 
stroma-free hemoglobins. It also relates to the analogous polypeptide di-beta globin. 

Information Disclosure Statement 

It is not always practical to transfuse a patient with donated blood. In these situations, use of a red blood cell substitute 
is desirable. The product must effectively transport O2. just as do red blood cells. ("Plasma expanders", such as dextran 
and albumin, do not transport oxygen.) The two types of substitutes that have been studied most extensively are hemo- 
globin solutions and fluorocarbon emulsions. 

Structure and Function of Hemoglobin 

Hemoglobin (Hgb) is the oxygen-carrying component of blood. Hemoglobin circulates through the bloodstream 
inside small enucleate cells called erythrocytes (red blood cells). Hemoglobin is a protein constructed from four asso- 
ciated polypeptide chains, and bearing prosthetic groups known as hemes. The erythrocyte helps maintain hemoglobin 
in its reduced, functional form The heme iron atom is susceptible to oxidation, but may be reduced again by one of two 
enzyme systems within the erythrocyte, the cytochrome b^ and glutathione reduction systems. 

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2. because it comprises two 
alpha and two beta chains). The alpha chain consists of 141 amino acids. The iron atom of the heme (ferroprotoporphyrin 
IX) group is bound covalently to the imidazole of His 87 (the "proximal histidine "). The beta chain is 146 residues long 
and heme is bound to It at His 92. Apohemoglobin is the heme-free analogue of hemoglobin; it exists predominantly as 
the op-globlndimer. 

Separated, heme-free, alpha-and beta-globins have been prepared from the heme-containing alpha and beta sub- 
units of hemoglobin. The separated heme-free globin chains are folded very differently, even though the heme-containing 
subunits are highly similar in secondary structure and basic folding features. This shows that the binding of the prosthetic 
heme group to globin subunits has quite different effects on alpha and beta globin. Yip, et al., J. Biol. Chem., 247: 7237- 
44(1972). 

Native human hemoglobin has been fully reconstituted from separated heme-free alpha and beta globin and from 
hemin. Preferably, heme is first added to the alpha-globin subunit. The heme-bound alpha globin is then conplexed to 
the heme-free beta subunit. Finally, heme is added to the half-filled globin dimer. and tetrameric hemoglobin is obtained. 
Yip, et al., PNAS (USA), 74: 64-68 (1977). 

In cell-free systems prepared from unfractionated rabbit reticulocyte hemolysates. globin is actively synthesized for 
approximately five minutes, and then protein synthesis abruptly ceases. Prior addition of hemin prevents or delays the 
cessation of synthetic activity, as a result of the effect of hemin on an inhibitory protein known as "hemin-regulated 
inhibitor" (HRI). Hemin deficiency has a more severe effect on alpha chain synthesis than on beta chain synthesis as 
alpha globin mRNA is less efficient than beta-globin mRNA in initiating polypeptide chain synthesis. It has been spec- 
ulated that alpha chains are released from their site of synthesis only in the presence of free beta chains, which imme- 
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diately complex the released alpha chains to form ap-globin dimers. These then combine with heme to form tetrameric 
hemoglobin. Winterhalter and Huehns, J. Biol. Chem., 239: 3699 {1 964). It is certainly known that the addition of heme 
to ap-globin dimers (apohemoglobin) leads to the rapid formation of hemoglobin. 

The human alpha and beta gtobin genes reside on chromosomes 16 and 11, respectively. Bunn and Forget, supra 
5 at 1 72 Both genes have been cloned and sequenced. Liebhaber, et a!.. PNAS 77: 7054-58 (1980) (alpha-globin genomic 
DNA); Marotta, et al.. J. Biol. Chem., 252: 5040-53 (1977) (beta globin cDNA); Lawn, et al.. Cell. 21:647 (1980) (beta 
globin genomic DNA). 

Hemoglobin exhibits cooperative binding of oxygen by the four subunits of the hemoglobin molecule (two alpha- 
globins and two beta-globins in the case of Hgb A), and this cooperatlvity greatly facilitates efficient oxygen transport. 
10 Gooperativity, achieved by the so-called heme-heme interaction, allowshemoglobinto vary its affinity for oxygen. Hemo- 
globin reversibly binds up to four moles of oxygen per mole of Hgb. 

Oxygen-carrying compounds are frequently compared by means of a device known as an oxygen dissociation curve. 
This curve is obtained when, for a given oxygen carrier, oxygen saturation or content is graphed against the partial 
pressure of oxygen. For Hgb, the percentage of saturation increases with partial pressure according to a sigmoid rela- 
15 tionship. The P50 is the partial pressure at which the oxygen-carrying solution is half saturated with oxygen. It is thus a 
measure of oxygen-binding affinity; the higher the P50. the more loosely the oxygen is held. 

When the oxygen dissociation curve of an oxygen-carrying solution is such that the P50 is less than that for whole 
blood, it is said to be "left-shifted." 

The oxygen affinity of hemoglobin is lowered by the presence of 2,3-diphosphoglycerate (2,3-DPG), chloride ions 
20 and hydrogen ions. Respiring tissue releases carbon dioxide into the blood and lowers its pH (i.e. increases the hydrogen 
ion concentration), thereby causing oxygen to dissociate from hemoglobin and allowing it to diffuse into individual cells. 

The ability of hemoglobin to alter its oxygen affinity, increasing the efficiency of oxygen transport around the body, 
is dependent on the presence of the metabolite 2,3-DPG. Inside the erythrocyte 2,3-DPG is present at a concentration 
nearly as great as that of hemoglobin itself. In the absence of 2,3-DPG "conventionar hemoglobin tiinds oxygen very 
25 tightly and would release little oxygen to respiring tissue. 

Aging erythrocytes release small amounts of free hemoglobin into the tHood plasma where it is rapidly bound by 
the scavenging protein haptoglobin. The hemogiobin-haptoglobin complex is removed from the blood and degraded by 
the spleen and liver. 

30 Blood Substitutes, Generally 

It is clear from the above considerations that free native hemoglobin A, injected directly into the bloodstream, would 
not support efficient oxygen transport about the body. The essential allosteric regulator 2,3-DPG is not present in suffi- 
cient concentration in the plasma to allow hemoglobin to release much oxygen at venous oxygen tension. 

35 Nonetheless, solutions of conventional hemoglobin have been used as RBG substitutes. The classic method of 
preparing hemoglobin solutions employs outdated blood. The red cells are lysed and cellular debris is removed, leaving 
what is hopefully "stromal-free hemoglobin" (SFH). 

Several basic problems have been observed with this approach. The solution must be freed of any toxic components 
of the red cell membrane without resorting to cumbersome and tedious procedures which would discourage targe-scale 

40 production. DeVenuto, "Appraisal of Hemoglobin Solution as a Blood Substitute", Surgery. Gynecology and Obstetrics . 
149: 417-436 (1979). 

Second, as expected, such solutions are "left-shifted" (lower P50) as compared to whole blood. Gould, et al., "The 
Development of Polymerized Pyrldoxylated Hemoglobin Solution as a Red Cell Substitute", Ann. Emerq. Med. 15: MIS- 
HI 9 (Dec. 3, 1986). 

45 Third, SFH has a half-life in the circulatory system of only about 2-4 hours. This is because oxy Hgb partially dis- 
sociates into a dimer (ofi) that is small enough to be filtered by the kidney 

Finally, SFH has a high colloid osmotic pressure (COD). Thus, administration of SFH in a dose that would have the 
same oxygen-carrying capacity as a unit of packed red blood cells is inadvisable, since the high osmotic pressure (60mm 
Hg) would cause a massive influx of water from the cells into the bloodstream, thus dehydrating the patient's tissues. 

50 This consideration limits the dose of SFH to that which provide a final concentration of about 6-8 gm Hgb/dl. 

In an effort to restore the desired P50. researchers added 2,3-DPQ to the hemoglobin solution. Unfortunately. 2,3- 
DPG was rapidly eliminated from the circulation. Scientists then turned to other organic phosphates, particularly pyridoxal 
phosphate Like 2,3-DPG, these compounds stabilized the "T state" of the Hgb by forming a salt bhdge between the N- 
termini of the two beta chains. The pyridoxylated hemoglobin had a P50 of 20-22 ton-, as conpared to 10 ton- for SFH 

55 and 28 torr for whole blood. While this is an improvement over SFH, the pyridoxylated Hgb remains "high affinity" relative 
to whole blood. 
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Chemical Crosslinking of Hemoglobin Subunits 

The properties of hemoglobin have been altered by specifically chemically crosslinking the alpha chains between 
the Lys99 of alphal and the Lys99 of alpha2. Walder. U.S. 4,600,531 and 4,598.064; Snyder, et al.. PNAS (USA) 84: 
7280-84 (1 987); Chaterjee. et al.. J. Biol. Chem., 261 : 9927-37 (1 986). The P50 was 29 mm Hg, and renal excretion was 
abrogated by the crosslinking, but the plasma half-life was increased just 2-3 fold. 

This chemical crosslinking was accomplished by reacting bis(3,5-dibromosalicyl) fumarate with deoxyhemoglobin 
A in the presence of inositol hexaphosphate. This reaction has a low yield (10-20%). Moreover, purification is required 
to eliminate derivatives modified at other sites (there are 42 other lysine residues and the amino terminal amino groups 
of the four chains at which competing reactions could occur). 

A further problem with the use of a "diaspirin" crosslinking agent is that it can participate in a side reaction yielding 
a carcinogenic halophenol. 

In the hemoglobin analogue of the present invention, the N-terminal valine and C-terminal arginine of the alpha 
globins are connected by means of an amino acid or peptide linker, without resort of special coupling agents. 

The beta chains have also been chemically crosslinked. Kavanaugh, et al., Biochemistry, 27. 1804-8(1988). 
Kavanaugh notes that the beta N-termini are 16 A apart in the T state and 20 A apart in the R state. Not surprisingly, 
the introduction of a DIDS bridge between the N-termini of T state hemoglobin hindered the shift to the R state, thereby 
decreasing the O2 affinity of the molecule. While the Kavanaugh analogue has desirable oxygen binding and renal 
clearance characteristics, it too is obtained in low yield. 

D. Gene Expression, Generally 

Gene expression embraces the transcription of DNA into messenger RNA. and the translatbn of messenger RNA 
into protein. The process of transcription begins when an enzyme, DNA-directed RNA polymerase, binds to DNA. The 
binding site for this enzyme is often called the "promoter. " and the binding of the enzyme to the promoter may be controlled 
by various repressors or inducers of transcription. The RNA polymerase slides along the DNA molecule, manufacturing 
a messenger RNA transcript. When it encounters a second regulatory element, the 'lerminator," the enzyme falls off. 
and the mRNA transcript is formed. 

Messenger RNA is used by the ribosomes, the protein factories of the cell, as a template for the construction of the 
corresponding protein . The ritx}somal binding site comprises the so-called Shine Delgarno (SD) sequence and a properly 
spaced initiation (start) codon. Beginning at a special RNA triplet known as the initiation codon, transfer RNAs bind to 
corresponding codons of the messenger. Each transfer RNA is two-handed; it binds to the messenger codon by means 
of a complementary anti-codon, while holding the corresponding amino acid in position to be linked into the growing 
polypeptide chain. The chain falls off when the ribosome encounters one of three special triplets known as "stop" codons. 
That part of the original gene which corresponds to the messenger sequence from the initiator codon to the last codon 
before the stop codon is known as the coding sequence. There is also a 5'-flanking sequence, which begins with the 
promoter, and a 3*-flanking sequence which ends with the terminator. 

Polycistronic Expression 

It is possible for a single messenger RNA transcript to have one promoter, but two or more pairs of start and stop 
codons that define distinctly translatable sequences. Each such sequence is known as a "cistron." and the polypeptide 
corresponding to the cistrons are thus co-expressed under the control of the single promoter. 

The majority of bacterial operons are polycistronic. that is. several different genes are transcribed as a single mes- 
sage from their operons. Examples include the lactose operon with three linked genes (lacZ, lac Y and lacA) and the 
tryptophan operon with five associated genes (IrgE. IebD. tUdP. imB, and trpA). In these operons. the synthesis of mes- 
senger RNA is initiated at the promoter and, within the transcript, coding regions are separated by intercistronic regions 
of various lengths. {An operon is a cluster of genes that is controlled as a single transcriptional genetic unit). Translational 
efficiency varies from cistron to cistron. Kastelein, et al.. Gene, 23; 245-54 (1983). 

When intercistronic regions are longer than the span of the ribosome (about 35 bases), dissociation at the stop 
codon of one cistron is followed by independent initiation at the next cistron. With shorter intercistronic regions, or with 
overlapping cistrons, the 30S subunit of a terminating ribosome may fail to dissociate from the polycistronic mRNA, 
being instantly attracted to the next translational initiation site. Lewin, Gene Expression . 143-148. 



Unlike bacterial mRNAs, eukaroyotic mRNAs are generally monocistronic in nature. Lewin, Gene Expression . 157. 

Synthetic polycistronic operons have been constructed and expressed in both prokaryotes and eukaryotes. 

Lee, et al.. Nucleic Acids Res.. 12; 6797 (1984) describe a special case of a synthetic polycistronic operon In which 
all of the cistrons express the same polypeptide. This homopolycistronic structure was constructed to maximize the gene 
dosage effect. 



EP 0 700 997 A1 



Schoner, et al.. PNAS, 83: 8506-10 (1986) translated a synthetic two-cistron mRNA in £ otIi. The first cistron was 
a short, arbitrary AU-rich sequence, while the second cistron was a mammalian gene (bGH). It was found that "read 
through" translation occurred if the stop codon of the first cistron followed the SD element of the second cistron and lay 
close to the start codon of the second cistron. Schoner's purpose was to overcome his failure to express Met-bGH with 
its native codons at high levels, possibly as a result of inhibition of translation by local secondary structures. The first 
cistron was engineered to favor ribosome binding (by placing the SD sequence and the AUG initiation codon in an AU- 
nch region free of local secondary structure). See also Schoner, et al., Meth. Enzymol.. 153: 401-416 (1987), which 
reveals that bGH overproduction by this technique was associated with the formation of protein granules. 

Saito, et al . J. Biochem., 101 : 1281-88 (1987) expressed a synthetic somatomedin C gene in coH using a two 
cistron system. They theorized that the instability of somatomedin C, a basic polypeptide, might be overcome by com- 
plexing it with an acidic polypeptide. Thus, they constructed a two-cistron system which could express both polypeptides. 
The termination codon of the first cistron overlapped the initiation codon of the second cistron. The transformants accu- 
mulated Somatomedin C at high levels. However, the somatomedin C was recovered in the form of insoluble pellets (see 
page 1282). 

The ribosomes of mammalian cells are likewise capable of reinitiating translation at an initiation codon downstream 
from a termination codon. Thus. Boel, et al., FEBS Lett . 21 9 1 81 (1987) expressed a dicistronic transcription unit in 
mammalian (CHO) cells. This unit directed synthesis of both the precursor of human pancreatic polypeptide and of a 
selectable genetic marker (mouse DHFR). 

CODON. WO88/05486 describes the production of dicistronic mRNA which encodes both a protein of interest (e.g. , 
tissue plasminogen activator) and a selectable phenotype (e.g.. neomydnresistance). The common promoter was, In 
each of the examples a derivative of the Harvey murine sarcoma virus, and the dicistronic mRNA was translated in 
suitable eukaryotic cells. 

GENENTECH. EP App1 1 1 7,058 discloses the expression in vertebrate host cells of a dicistronic expression vector 
wherein one cistron codes for the desired protein (e.g., HbsAg) and a second codes for a second protein (e.g.. DHFR) 
whose synthesis is subject to environmental control (e.g.. with methotrexate). 

F. Fused Genes and Proteins, Generally 

Genes may be fused together by removing the stop codon of the first gene, and joining It in phase to the second 
gene. Parts of genes may also be fused, and spacer DNAs which maintain phase may be interposed between the fused 
sequences. The product of a fused gene is a single polypeptide, not a plurality of polypeptides as is expressed by a 
polycistronic operon. Different genes have been fused together for a variety of purposes. Thus, Gilbert. U.S. 4,338.397 
inserted a rat preproinsulin gene behind a fragment of the E coli penicillinase gene. His purpose was to direct E coj] 
transformants to secrete the expression product of the fused gene. Fused genes have also been prepared so that a 
non-antigenic polypeptide may be expressed already conjugated to an immunogenic carrier protein. The present inven- 
tion, however, contemplates the joining of two copies of the same gene. 

The use of linker DNA sequences to join two different DNA sequences is known. These linkers are used to provide 
restriction sites for DNA cleavage, or to encode peptides having a unique character that facilitates purification of the 
encoded fusion protein or a fragment thereof. See. e.g., Rutter, U.S 4.769.326. 

The lectin of Pisum sativum seeds is synthesized as a single 275-amino acid preproprotein consisting of a signal 
sequence followed first by the beta chain and then by the alpha chain. The signal sequence is removed in the endoplasmic 
reticulum, and In the protein bodies the resulting "prolectin" is cleaved Into a 187-AA beta chain and a 58-AA alpha 
chain. (Further processing results In truncation at the carboxyl termini). While the pea seed isolate is thus a heterodimer, 
it was discovered that the uncleaved naturally-occurring "prolectin" also binds cartx)hydrates, and that this "prolectin" 
could be expressed in E soli and recovered in functional form. Stubbs, et al., J. Biol. Chem.. 26: 6141-44 (1986). 

Toth, U.S. 4,774.180 teaches the expression of polyprotein. This polyprotein was ma6e from a fused DNA sequence 
encoding both a first polypeptide which catalyzes the reaction of glycine with ATP to form glycyl-adenylate and a second 
polypeptide which reacts glycyl adenylate with tRNA^*-^ to obtain the glycine-charged tRNA. These two polypeptides 
are the alpha and beta subunits of glycine tRNA synthetase which has an a2p2 quaternary structure. The two subunits, 
in the E eoli genome, are encoded by a single dicistronic gene. Toth linked the coding region of the alpha chain to the 
coding region of the beta chain by means of a linker encoding six amino ackte. See also Toth and Schimmel, J. Biol. 
Chem., 261: 6643-46 (May 1986). 

Ladner, U.S. 4,704,692 describes an expert system for finding linkers which may be used to convert two naturally 
aggregated but chemically separated polypeptide chains into a single polypeptkle chain with a similar conformation after 
folding. This system relies on a database containing amino acid sequences for which 3-D structures are known. The 
database is examined for candidate amino acid sequences with a span similar in length to the Interchain gap to be 
bridged. The direction and orientation of the candidate peptides in then checked. The algorithm assumes that these 
peptides will maintain the same length and orientation regardless of the flanking sequences. 
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Ladner, WO88/06601 presents a hypothetical approach to the preparation of "pseudodimeric" analogues of dimeric 
repressor proteins. In essence, an amino acid linker is introduced to convert the dimeric molecule into a single chain. 
According to Ladner. this linker may be designed directly by the method of the '692 patent; alternatively, the linker- 
encoding DNA is a random oligonucleotide and in vivo selection is used to find a pseudodimer whose linker permits the 

5 molecule to fold correctly and bind sequence-specifically to DNA. 

Hallewell, et a!.. J. Biol. Chem.. 264; 5260-68 (1989) prepared an analogue of CuZn superoxide dismutase. Each 
dismutase molecule is a dimer of two identical subunits; a copper ion and a zinc ion are liganded to the subunit. The 
dimer interaction in CuZn superoxide dismutase is so strong that the subunits have not been separated without inacti- 
vating the enzyme. The enzyme has considerable conformational similarity to immunoglobulins; Hallewell. et al., joined 

10 two human superoxide dismutase genes, either directly or with DNA encoding a 19-residue human Immunologlobulin 
IgAI hinge region and expressed the fused genes in a transformed host. In attempting to express the directly joined 
genes, recombination occurred to eliminate one of the tandem genes in come plasmid molecules. Hallewell, et al., 
postulated that the direct connection distorted the dimer, causing the exposure of hydrophobic areas which then had a 
toxic effect. This would have provided selection pressure favoring gene deletion. No recombination was detected with 

75 the IgAI linker construction. 

Unfortunately it cannot be assumed that a pseudodimeric fusion protein containing a peptide linker will fold properly 
so to be a functional equivalent of its parental heterodimer. 

G. Expression of Soluble Proteins 

20 

Efforts to produce heterologous proteins in transformed cells sometimes result in the precipitation of some or all of 
the protein as insoluble inclusion bodies, also known as refractile bodies. See, e.g.. Paul, et al.. Eur. J. Cell Biol., 31 :1 71 - 
174 (1983) (human prolnsulin/E coii trpE fusion protein); Denefle, et al.. Gene. 56:61-70 (1987) (angogenin); Langley, 
et al.. Eur. J. Biochem., 163:313-321 (1987) (bovine growth hormone); Petrov, et al., Biology of the Cell. 61 :1-4 (1987) 

25 (calcitonin); Richardson, et al.. Biochim. Biophys. Acta, 950:385-94 (1988) (ricin B chain); Davis, et al.. Biochemistry, 
26:1322-26 (1987) (tumor neaosis factor); Lee, et al., Biochim. Biophys. Res. Commun., 151 :598-607 (1988) (gamma 
interferon); Meng. et al., J. Chromatogr., 443:183-92 (1988) (Somatomedin C); Tsuji. et al.. Biochemistry, 26:3129-34 
(1987) (interleukin-2). The term "refractile" refers to the ability to observe these bodies by phase contrast microscopy. 
Frequently, this insoluble protein retains only a fraction of the expected biological activity, possibly due to incorrect folding. 

30 It has been suggested that inclusion bodies are formed when molecules of partially folded proteins interact with each 
other faster than they can fold into their native, active conformation. Kruger, et al., Biopharm. 40 (March 1989); Haase- 
Pettingell and King, J, Biol. Chem., 263: 4977-83 (1988). "Factors contributing to the formation of inclusion bodies in 
recombinant bacteria remain obscure and it is not easy to predict the physical state of the product of a newly expressed 
eukaryotic gene in E colt ." Petrov, supra . 

35 While the formation of these inclusion bodies results in enrichment of the recombinant protein, and is therefore 
sometimes desirable, it also necessitates solubilization of the aggregates and regeneration of the protein's biological 
activity. Petrov. suora at 4. comnronts, "sometimes these obstacles seem to be the most critical point of the recombinant 
technology." 

Attempts have been made to solubilize and renature these proteins. Wetzel. U.S. 4.599.197; Builder. U.S 4,620,948; 

40 Olson. U.S. 4.511,503; Jones, U.S. 4,512,922. However, such efforts can be laborious and uncertain of success. See 
Giantini and Shatkin. Gene, 56: 153-160 (1987), As stated by Weir and Sparks. Biochem, J.. 245: 85-91 (1987). "proteins 
vary considerably in their optimal conditions for renaturation; various factors such as pH, salt concentration and type, 
rate of removal of denaturant. concentration of the target protein and of contaminants may strongly affect the recovery 
of authentic protein." These conplications are avoided if the protein of interest is expressed in soluble form. 

45 Gatenby, etal., Eur. J. Biochem., 168:227-31 (1987) has discussed difficulties in the preparation of the higher plant 
enzyme ribulose-bisphosphate carboxylase. This enzyme has the subunit structure LeSs, where L is a large subunit and 
is a small subunit. In nature, a binding protein apparently maintains L in soluble form prior to assemtMy with S. Attempts 
to assemble an active higher plant RuBPCase in E coji have been frustrated by the formation of an insoluble, inactive 
aggregate of L. 

50 

H. Bacterial Expression of Human Alpha and beta Globins 

Nagai and Thorgerson (Nature, 309: 810-81 2. 1 984) e)?)ressed in E. coli a hybrid protein consisting of the 31 amino- 
terminal residues of the lambda cll protein, an lle-Glu-Gly-Arg linker, and the complete human beta globin chain. They 
55 cleaved the hybrid immediately after the linker with blood coagulation factor Xa, thus liberating the beta-globin chain. 
Later, Nagai, etal., PN.A.S. (U.S.A.), 82:7252-55 (1985) took the recombinant DNA-derived human beta globin. naturally 
derived human alpha globin, and a source of heme and succeeded in producing active human hemoglobin. Because 
the alpha globin was derived from erythrocytes, the final product may contain undesirable erythrocyte membrane con- 
stituents. 
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More recently, an efficient bacterial expression system for human alpha gtobin was reported. GB 8711614, filed 
May 16, 1987; see copending Ser. No. 07/194,338 and WO 88/09179. This led to the production of wholly synthetic 
human hemoglobin by separate expression of the insoluble globin subunits in separate bacterial cell lines, and in siJu 
refolding of the chains in the presence of oxidized heme cofactor to obtain tetrameric hemoglobin. This procedure is 

5 laborious and low in yield. It requires the use of denaturing solvents (urea and guanidine). and chemical reduction of 
ferric ion to the ferrous state (see example). One object of the present invention is to overcome these disadvantages. 

While human alpha and beta globins may be expressed separately in E. coli . Walder. Proceedings. Biotech USA 
1988 (San Franciso, Nov. 14-16, 1988) warns at page 360, "isolated alpha and beta [globin] chains are unstable and 
tend to precipitate." If human alpha and beta globin are not produced in soluble form, they must be solubilized with 

10 denaturing agents and then refolded to restore activity. Moreover, when a wild-type alpha globin gene is expressed in 
E cell, alpha globin accumulates only slowly It is not certain whether this is due to inefficient translation or to the action 
of host proteases, but WO 88/091 79 teaches that this problem may be overcome by fusing a short section of the beta 
globin gene to the alpha globin gene, so that a hybrid protein is produced. This hybrid protein must then be cleaved, 
e.g. . with a protease, to release the globin. ff the protease is not completely selective (perhaps because of contamination 

15 by other proteases), the desired cleavage product may not be the only one obtained. In any event, that product must be 
separated from other E coli polypeptides, and any contaminants associated with the protease 

Sperm whale myoglobin has been expressed in E, coli, demonstrating that bacteria can incorporate prosthetic heme 
groups into a protein expressed from a cloned eukaryotic gene. Springer and Sligar. PNAS (USA) 84: 8961 -65 (1987). 
Walder says, "it remains to be seen if hemoglobin can be similarly made if txith the alpha and beta chains are expressed 

20 within the same cell." While there is a high degree of tertiary structure similarity t)etween myoglobin (a single chain 
protein) and the individual alpha and beta globin subunits. hemoglobin is a heterotetrameric protein, the primary globin 
sequences have no more than a 27% homology and myoglobin is now known to enjoy significantly higher stability than 
either alpha or beta globin. Thus, it could not be predicted that co-expression of alpha- and beta-globin in the same cell 
would result in intracellular assembly of a functional hemoglobin, which requires proper folding of the alpha and beta 

25 chains and incorporation of heme. 

I. Human Gene Expression in Yeast, Generally 

A number of human proteins have been expressed in transformed yeast cells, especially Saccharonwces cerevisiae . 
30 either cytoplasmically or by secretion into the culture medium. King, etal., Biochem.Soc. Transac, 16:1083-1086(1988). 
But success is not guaranteed. Thim, el al., FEBS Lett., 212:307-312 (1 987) experienced difficulty In obtaining properly 
crosslinked insulin from yeast cells in which the intact proinsulin-encoding gene had been inserted. They overcame this 
problem by constructing a modified proinsulin gene which encoded the B and A chains linked by a hexapeptide spacer. 
The product of this gene was cleaved and the two chains were properly folded and crosslinked by the cells. 
35 Richardson, et al.. Biochim. Biophys. Acta, 950:385-94 (1988) expressed the B chain of the heterodimeric protein 
ricin in E. coli . They reported that it was hard to obtain high levels of secretion of a yeast alpha factor leader/ricin B chain 
fusion protein. No attempt was made to co-express and assemble the ricin A and B chains. 

Murakami, et a!., DNA, 6:189-97 (1987) reported production of a heme-containing fused enzyme in transformed 
yeast cells. 

40 Honwitz, et al., PNAS (USA). 85:8678-82 (Nov. 1988) described the construction of yeast strains which secrete 
functional mouse variable region/human IgGi constant region chimeric antibodies into the culture medium. They char- 
acterize their paper the first report of the secretion of a foreign multimeric or heterodimeric protein in yeast. But see also 
Carlson, Mol Cell. BioL. 8:2638-46 (June 1 988)., showing transcription and translation of heavy and light-chain cDNAs 
into polypeptides which associate and bind antigen. 

45 Beggs, et al.. Nature. 283:835 (1980) attempted to express a chromosomal rabbit beta globin gene in S. cerevisiae . 
However, these yeast cells were unable to properly splice the intron-containing globin mRNA transcript. 

No admission is made that any reference cited herein is prior art. The description of the work and the citation of 
publication date are based solely on the published information and the applicants reserve the right to question the 
accuracy of that information. 

50 

SUMMARY OF THE INVENTION 

It is the object of this invention to overcome the aforementioned deficiencies of the prior art. For example, Applicants 
have achieved the first complete expression and assembly of tetrameric hemoglobin in cells which do not produce hemo- 
55 globin in nature. Prior work has related to the separate expression of alpha and beta globin and their extracellular com- 
bination with heme to form hemoglobin. 

A central feature of the present invention is the intracellular assembly of human alpha- and beta-globin-like polypep- 
tides and intracellular incorporation of heme to form a biologically functional hemoglobin-like protein. This intracellular 
assembly is achieved by expressing the alpha and beta-globin-like polypeptides in the same cell so that fold together 
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and incorporate heme. An important characteristic of this invention is a substantia! reduction the formation of insoluble 
globin inaggregates, in particular of beta globin, as compared to what is observed when globins are separately expressed 
in CQli or S. cerevisiae . Co-expression may be achieved from genes on two separate but compatible plasmids in the 
same cell, or from two different operons on the same plasmid. or from a single polyclstronic operon. 

In one embodiment, the human alpha- and beta-globin-like polypeptides are co-expressed in bacterial cells. The 
corresponding genes may be included in the same synthetic operon (i .e., driven by one promoter), or placed in separate 
operons with separate promoters (which may be the same or different). Preferably, expression of the alpha- and beta- 
globin is enhanced by placing a "ribosomal loader" sequence as hereafter described before each globin gene. This is 
particularly advantageous in the case of alpha globin which is more difficult to produce in quantity. 

In another embodiment, the human alpha- and beta-globin-like polypeptides are co-expressed in yeast cells. 
Improvements in both the yield of the alpha globin and the solubility of beta globin are obtained. 

A further aspect of the invention is the production of novel intermediates, human di-alpha globin and human di-beta 
globin (and mutants thereof), which can be expressed in a cell and assembled with each other or with beta- or alpha- 
globin-like polypeptides, respectively, into a human hemoglobin -like protein. While intracellular assembly is not strictly 
required, di-alpha and di-beta globin may be considered specially adapted to intracellular assembly of a functional hemo- 
globin since expression of, e.g., a di-alpha globin is analogous in some respects to intracellular assembly of two alpha 
globin sub-units, differing from assembly as previously discussed in that the association is accomplished by expression 
of a covalent peptide linker rather than by noncovalent interaction of the subunlts. Di-alpha and Dl-beta-globln-like 
polypeptides may be expressed in, preferably, bacterial cells or in yeast cells. 

These facets of the Invention are now discussed In greater detail. 

Yeast Expression of HemoqIobln-LIke Proteins 

Applicant have discovered that it Is possible to produce human hemoglobin (or mutants thereof) in yeast, especially 
Saccharomyces cerevisiae . The use of a yeast expression system obviates the need to separate the hemoglol>in from 
bacterial endotoxins. We have also found that alpha and beta globins with the correct N-terminal amino acid may be 
obtained without first expressing the globin as a part of selectively cleavable fusion protein. We believe that this is 
because the yeast enzyme methionyl aminopeptldase should be capable of removing the N-terminal methionine from 
Met-alpha-globin and Met-beta-globin to expose the desired N-terminal amino acid (Valine) . Production of altered oxygen 
affinity mutants as discussed In W08d/091 79 Is of special Interest. Such mutants may be produced by site-specific 
mutagenesis of globin genes followed by cloning and expression In yeast. 

In a preferred embodiment, expression is controlled by a "gal-gap49" hyt)rld promoter as hereafter defined. 

Co-Expression of Alpha and Beta Globin Genes in Yeast Cells 

In a preferred embodiment, the alpha and beta globin genes are both expressed within the same yeast cell. Expres- 
sion of the beta globin gene alone results in the production of beta globin as a largely insoluble, difficult-to-extract protein 
comprising less than 2% of the total cell protein. Expression of the alpha globin gene alone results in production of alpha 
globin at very low levels (under 0.5% of the total cell protein). In neither case is heme incorporated. When, however, the 
alpha and betaglobm genes are co-expressed, the transformed yeast cells fold the alpha and beta globin chains together 
and incorporate heme groups to obtain functional recombinant human hemoglobin m soluble form, accumulating to 
about 10% of the total cell protein, without any change in the promoters operably linked to the genes. 

The alpha and beta globin genes may. in turn, be carried on different plasmids or on the same plasmid within the 
host cell. 

Polycistronic Go- Expression of the Al pha and Beta Globin Genes in Bacterial Cells. 

Applicants have translationaily coupled alpha and beta globin genes to a small "ribosomal loader" gene encoding 
a small consumable peptide that will lead the ribosome directly Into the ATG of the desired alpha and beta globin message 
and thus enhance transiatlonal efficiency The have also placed the alpha and beta globin genes in the same operon 
so they are transcribed into a single polycistronic mRNA transcript. The globins are then translated as separate polypep- 
tide chains which subsequently are folded together and joined with intracellular heme by transformed cells to form the 
hemoglobin tetramer. Applicant's method overcomes the problem associated with separate purification of precipitated 
alpha and beta globins. . 

The polycistronic expression and assembly of a heterooligomeric human protein in soluble, active form in a heter- 
ologous host has not been previously reported. It Is especially noteworthy that this was a mammalian protein expressed 
In a prokaryotic (bacterial) host. It should further be considered that this protein Incorporates prosthetic groups, which 
add a further complication to the goal of proper post-translational processing. 
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In one embodiment. Met-FX -alpha globin and Met-FX-beta globin are co-expressed, where FX denotes a leader 
peptide which a recognition site for Factor Xa cleavage activity. FX-alpha globin and FX-beta globin assemble to form 
a mutant hemoglobin with reversible oxygen binding activity, albeit higher in affinity for oxygen than native hemoglobin. 
Alternatively, the FX leader, or other fused leader, may be cleaved to obtain a duplicate of native Hgb. 

In another embodiment. Met-alpha globin and Met-beta globin are co-expressed. This eliminates the need for a 
cleavage step. 

In a third embodiment, des-val-alpha globin and des-val beta globin are co-expressed. Native alpha and beta globin 
both begin with valine. The valine may, however, be replaced with methionine, which is of similar hydrophobicity 

In further embodiments, one or more codons of the native genes are altered so that a alpha and/or beta globin- 
related protein characterized by one or more amino acid differences (insertions, deletions or substitutions) from the 
native species Is produced. 

Pi -Alpha and Pi-Beta Globins 

A new protein, di-alpha globin, has been prepared, which consists of two alpha globin amino acid sequences cov- 
alently connected by peptide bonds, preferably through an intermediate linker of one or more amino acids. Surprisingly, 
these "genetically fused " alpha globin chains were capable of appropriately folding and combining with beta globin and 
heme to produce functional hemoglobin analogue. The term "genetically fused" refers to the method of production. Two 
copies of the globin gene are fused together, preferably with a spacer PNA encoding the amino acid linker, so the 
construct directly encodes the desired di-alpha globin. The term "analogue" is used because in native hemoglobin, the 
alphal and alphas subunits are noncovalently bound. The analogous preparation of di-beta globin is also envisioned. 

The preparation of "genetically fused" hemoglobins avoids the disadvantages of chemical crosslinking. The latter 
is inefficient and often requires deoxygenation of the hemoglobin solution and the presence of another molecule (e g., 
inositol hexaphosphate or 2.3-DPG) to prevent competing reactions. 

In a prefen*ed embodiment, the di-alpha globin and/or the beta globin contain mutations which reduce the oxygen- 
binding affinity of the henrwglobin analogue in solution so as to approach the oxygen-binding characteristics of whole 
Wood. 

The di-alpha hemoglobin advantageously exhibits a substantially longer half-life in the circulatory system than does 
conventional (des-val) recombinant hemoglobin. Preferably, in humans, the half-life exceeds 9 hours at a dose of at least 
1 gm/Kgm body weight. This would be expected to correspond to a half-life of about 3 hours in rats given a comparable 
dose. 

The di-alpha and di-beta globins can be expressed in both bacteria and yeast. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 : Flowchart for construction of plasmids for expression of FX-alpha globin (pDL ll-62m), FX-beta globin 
(pDL 11-1 Oa), and FX-hemoglobin (pDL ll-66a) are schematized. 

Figure 2 : Flowcharts for construction of plasmid pDL 1 11-1 a (2a) bearing drcistronic Pes-Val -Alpha globin gene under 
control of Tac promoter, and polycistronic di-alpha/beta co-expression plasmid pDL lll-47a (2b). 

Figure 3 : Flowchart for construction of plasmid for co-expression of Met-alpha and Met-beta globins, pDL 111-1 3e. 

Figure 4 : Oligonucleotides for construction of synthetic FX-alpha and FX-beta globin genes. The top strand is shown 
5' to 3' and the bottom strand as 3' to 5'. Areas of overlap between complementary synthetic oligonucle- 
otides are shown as areas where both strands are shown in the same case letters. The EsU site that joins 
FX-alpha and FX-beta occurs at the overlap of SJH l-35a and SJH l-36b. 

Figure 5 : Synthetic gene for expression of Met-FX-alpha and Met-FX-beta globin. Region A contains the alpha 
globin gene and region B the beta globin gene. The location of the Factor X sequence and the two Shine- 
Delgarno sequences (SD#1 and SD#2) in both regions is indicated. Selected restriction sites are also 
found. The translated amino acid sequences for tiie ribosomal loader and Met-FX-alpha/and beta-globin 
are given. 

Figure 6 : Elution profile and absorbance spectrum for FX-hemoglcbin. 



Figure 7 : Oligonucleotides for construction of mutant hemoglobins (differing in amino acid sequence from conven- 
tional hemoglobin). 
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Figure 8 : Oligonucleotides for construction of plasmids which do not encode the Factor Xa substrate recognition 
site. 

Figure 10 : Oxygen Binding of Des-Fx Hgb 
Figure 11 : Plasmids pDL III-14C and pDL lll-38b 

Fioure 1 2 Shows the sequence of a preferred synthetic gene for expression of (des-Val)-alpha-(GlyGly)-alpha and 
des-Val beta globin. A shows the region (EcoRI to PatI) containing Shine-Deigarno ribosomal binding 

10 sites (SD#1 and SD#2), the sequence expressing the octapeptide (Met. .Glu) which serves as a cotrans- 

lational coupler, and the sequence encoding the two nearly identical alpha globin -like polypeptides and 
the interposed Gly-Gly linker. The first alpha globin sequence begins 'Met-Leu", that is, it contains an 
artifactual methionine, omits the valine which is the normal first residue of mature alpha globin, and con- 
tinues with the second residue, leucine. The second alpha globin sequence begins "Val-Leu", immediately 

15 after the underlined "Gly-Gly" linker. Start and stop codons are underlined. B shows the analogous region 

(PstI to HIndlll) containing the coding sequence for des-Val beta globin. A and B are connected at the 
PstI site to form a single polycistronic operon. 

Figure 13 : Shows the structure of the final expression vector pDL lll-47a. "PTac" is the Tac promoter, and "ampicillin" 
20 is the ampicillin resistance gene. Figure 1 3a shows an Xbal-BamHI fragment of PDL lll-47a. 

Figure 14 : Plasmid pSGE0.0E4 

RourelS : Plasmid pSGEI . 1 E4 

Figure 16 Plasmid pDL IV-67a 

Fi gure 17 Plasmid pJR Vl-54a 

Figure 18 Plasmid pDL di-alphaAsetaADeta 

Fioure 19 Flowchart showing the construction off various expression vectors featuring lambda Pl regulation of var- 
ious polycistronic globin operons. 

Figure 20 Nucleotide Sequence of GAL-GAP promoter, with restriction sites indicated. The region from SphI to 
EcoRV contains a synthetic GALi_io regulatory region (M. Johnston and R. Davis. 1984. Molecular and 
Cellular Biology, 4:1440-1448). The UAS is in the region numbered 29-63 on this Figure. The region from 
EcoRV to the Xbal site contains the consensus GAP491 transcriptional start region, with the approximate 
start of transcription being at 395. (L McAlister and M.J. Holland. 1983. J. Biol. Chem., 260:15019-15027: 
J. Holland, et al. 1983. J. Biol. Chem., 258:5291-5299.) 

Figure 21 Flowcharts showing construction of beta-globin expression cassette (21a and 21b). 

F igure 22 Flowcharts showing construction of beta-giobin expression vector pGS4988 (22a) and alpha>globin 
expression vector pGS4688 (22b). 

Figure 23 Flowcharts (23a and 23b) showing construction of an alpha^lobin expression cassette and of vectors 
pGS289 and pGS389 for co-expression of alpha- and beta globin from the same plasmid. Note that alpha 
and beta globin are expressed from separate promoters. 

Figure 24 : Absorbtion spectra for yeast-produced recombinant arxJ native human hemoglobin. 

Figure 25 : Flowchart (25a) showing construction of di-alpha/beta hemoglobin yeast expression vector and map of 
plasmid pGS3089 (25b). 

Figure 26 : Map of plasmid pGS3089RGV desp. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The structure of coriventional hemoglobin is well known. We herewith incorporate by reference the entire text of 
Bunn and Forget, eds., Hemoglobin: Molecular. Genetic arxj Clinical Aspects (W.B. Saunders Co., Philadelphia, PA: 
5 1986) and of Fermi and Perutz "Hemoglobin and Myoglobin," in Phillips and Richards. Atlas of Molecular Structures in 
Biology (Clarendon Press: 1981). 

The primary structure of a polypeptide is defined by its amino acid sequence and by identification of any modifications 
of the side chains of the individual amino acids. 

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2, because it comprises two 
10 alpha and two beta chains). The alpha chain consists of 141 amino acids (See Figure 11). The iron atom of the heme 
(ferroprotoporphyrin IX) group is bound covalently to the imidazole of his 87 (the "proximal histidine"). The beta chain is 
146 residues long (see Figure 1 2) and heme is bound to it at his 92. 

Other recognized hemoglobin species are Hgb A2 (02 a2 62). Hgb A^g, Hgb A^b, and Hgb A^c. as well as the rare 
species Hgb F (a2 gamma2). Hgb Gower-1 (Zeta2 epsilon2), Hgb Gower-2 (alpha2 epsilon2). Hgb Portland (Zeta2 
15 gamma2), and Hgb H (beta4) and Hgb Bart (gamma4). They are distinguished from Hgb A by a different selection of 
polypeptide chains. 

Segments of polypeptide chains may be stabilized by folding into one of two common conformations, the alpha helix 
and the beta pleated sheet. In its native state, about 75% of the hemoglobin molecule is alpha-helical. Alpha-helical 
segments are separated by segments wherein the chain is less constrained. It is conventional to identify the alpha-helical 

2G segments of each chain by letters, e.g., the proximal histidine of the alpha chain is F8 (residue 8 of helix F). The non- 
helical segments are identified by letter pairs, indicating which helical segments they connect. Thus, nonhelica) segment 
BC lies between helix B and helix C In comparing two variants of a particular hemoglobin chain, it may be enlightening 
to attempt to align the helical segments when seeking to find structural homologies For the amino acid sequence and 
helical residue notation for converrtional human hemoglobin A^ alpha and beta chains, see Bunn and Forget, supra , and 

25 Table 1 herein. 

The tertiary structure of the hemoglobin molecule refers to the steric relationships of amino acid residues that are 
far apart in the linear sequence, while quaternary structure refers to the way in which the subunits (chains) are packed 
together. The tertiary and quaternary structure of the hemoglobin molecule have been discerned by X-ray diffraction 
analysis of hemoglobin crystals, which allows one to calculate the three-dimensional positions of the very atoms of the 
30 molecule. 

In its unoxygenated ("deoxy", or "T" for "tense") form, the subunits of hemoglobin (alphal , alpha2, betal , and beta2) 
form a tetrahedron having a twofold axis of symmetry. The axis runs down a water-filled "central cavity". The subunits 
interact with one another by means of Van der Waals forces, hydrogen bonds and by ionic interactions (or "salt bridges"). 
The alphal beta 1 and alpha2beta2 interfaces remain relatively fixed during oxygenation. In contrast, there is considerable 
35 flux at the alpha 1 beta2 (and alpha2 betal ) interface. In its oxygenated ("oxy", or "R" for "relaxed" form), the intersubunit 
distances are increased. 

The tertiary and quaternary structures of native oxyhemoglobin and deoxy hemoglobin are sufficiently well known 
that almost all of the nonhydrogen atoms can be positioned with an accuracy of 0.5 A° or better. For human deoxyhe- 
moglobin. see Fermi, et al., J. Mol. Biol., 175: 159 (1984), and for human oxyhemoglobin, see Shaanan, J. Mol. Biol., 

40 171 : 31 (1 983), both incorporated by reference. 

While analyses of hemoglobin structure tend to focus on the alpha-beta interfaces, it is known that the distance 
between the amino terminus of one alpha subunit and the carboxyl terminus of the other is about 5.6 A° in the deoxy 
configuration and 3.3 A° in the oxy configuration. 

For the purpose of the appended claims, a hemoglobin-like protein Is an oxygen binding protein with a plurality of 

45 heme prosthetic groups and a P50 of at least about 6 torr (when measured as for Table 1 of Ser. No. 07/194,338), and 
composed of a plurality of polypeptides each of which Is a human alpha (or dl-alpha) giobin-like or human beta (or di- 
beta) globin-like polypeptide. A human alpha globin-like polypeptide is native human alpha globin or a mutant thereof 
differing from the native sequence by one or more substitutions, deletions or insertions, while remaining at least 7S% 
homologous with humein alpha globin, and still capable of incorporating heme and associating with beta globin. A beta 

50 globin-like subunit is analogously defined. Subunits of animal hemoglobins or mutants thereof which are sufficiently 
homologous with human alpha or beta globin are ennbraced by the term "human alpha or beta globin-like polypeptide.'' 
For example, the subunits of bovine hemoglobin are within the scope of these terms. The alpha- and beta-globln-like 
polypeptides may be referred to collectively as "globins". 

For the purpose of the appended claims, a di -alpha globin-like polypeptide is one which consists essentially of two 

55 alpha-globin-like polypeptide sequences connected by peptide bonds between the normal C-terminus of the first alpha- 
globin-llke polypeptide and the normal N-terminus of the second alpha-globin-IIke polypeptide. These two sequences 
may be directly connected, or connected through a peptide linker of one or more amino acids. Alpha globin chains 
crosslinked at the N- and C-terminals other than by peptide bonds (e.g., by DIDS) are excluded. The di-alpha globin- 
like polypeptide must be capable of folding together with beta globin and Incorporating heme to form functional hemo- 
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globin-like protein. The di-beta globin-like polypeptide is analogously defined. The di-alpha and di-beta globin-like 
polypeptides may collectively be referred to as "pseudodimeric globin-like polypeptides". 

A "Met FX alpha globin" is an alpha globin-like polypeptide comprising an N-terminal methionine, a oligopeptide 
which acts as a recognition site for Factor Xa (e.g., Ile-Glu-Gly-Arg), and an alpha globin-like sequence (e.g.. Val-His- 
5 Leu-Thr-Pro . ) which may correspond to wild-type alpha globin or to a mutant thereof as taught herein. The term "Met 
FX alpha globin" is sometimes abbreviated as "FX alpha globin". "FX beta globin" is an analogously defined beta globin- 
like polypeptide. 

"Met-alpha globin" is an alpha globin-like polypeptide with an extra N-terminal methionine. The second amino acid 
is valine, which is the first amino acid of mature wild-type alpha globin Met-beta globin Is analogously defined. A "Des- 
10 FX alpha globin" gene (or "dFFX alpha globin") is a Met-alpha globin gene obtained by excising the FX codons from a 
Met-FX alpha globin gene. Note that "Met-Hgb" is used to refer to methionyl Hgb formed from methionyl-alpha globin 
and methionyl-beta globin. 

"Des-Val-alpha globin" (or "dVal alpha globin") is an alpha globln-like polypeptide wherein methionine is substituted 
for the valine which begins the sequence of mature wild-type alpha globin. Des-Val-beta globin is analogously defined. 

75 Des-Val-alpha/alpha globin (di-des Val alpha globin) is a "di-alpha globin" in which a "Des-Val-alpha" sequence is linked 
via an appropriate peptidyl linker to an alpha globin-like sequence which begins with Val. 

The alpha and beta globin-like chains need not correspond exactly in sequence to the alpha and be\a globins of 
"conventional" hemoglobin. Rather, mutations maybe introduced to alter the oxygen affinity or stability of the hemoglobin, 
or the ease of expression and assembly of the individual chains. By way of example and not limitation, several mutant 

20 hemoglobins have been prepared by the method of this invention. Guidance as to further mutations is provided by the 
copending application of Hoffman and Nagat. Ser. No. 07/194,338, incorporated by reference herein. 

The DNA sequences encoding the individual alpha (or di-alpha) and beta (or di-beta) globin chains may be of 
genomic, cDNAand synthetic origin, or a combination thereof. Since the genomic globin genes contains introns, genomic 
DNA must either be expressed tn a host which can properly splice the premessenger RNA or modified by excising the 

25 introns. Use of an at least partially synthetic gene is preferable for several reasons. Rrst, the codons er)coding the desired 
amino acids may be selected with a view to providing unique or nearly unique restriction sites at convenient points in 
the sequence, thus facilitating rapid alteration of the sequence by cassette mutagenesis. Second, the codon selection 
may be made to optimize expression in a selected host. For codon preferences in E. coli . see Kbnigsberg, et al., FN AS, 
80:687-91 (1983). For codon preferences in yeast, see the next section. Finally, secondary structures formed by the 

30 messenger RNA transcript may interfere with transcription or translation. If so, these secondary structures may be elim- 
inated by altering the codon selectbns. 

Of course, if a linker is used to genetically crosslink sut>unlts, the linker will normally be encoded by a synthetic 
DNA. While the di-alpha globin and the beta globin may be expressed separately and then combined with each other 
and heme in vitro, they are preferably placed on one plasmid. 

35 The present invention is not limited to the use of any particular host cell, vector, or promoter. However, the preferred 
host cells are bacterial (especially, eoil) and yeast (especially cerevisiae) cells. The promoter selected must be 
functional in the desired host cells. It preferably is an irxJuciWe promoter which, upon induction, provides a high rate of 
transcription. Apreferred fc>acterial promoter is the Tac promoter, a trp/lac hybrid described fully in DeBoer, U.S. 4.551 ,433 
and commercially available from Pharmacia-LKB Other promoters which might be used include the temperature sen- 

40 sitive lambda Pl and Pr promoters, as well as the lac. trp, trc, pIN (lipoprotein promoter and lac operator hybrid), gal 
and heat shock promoters. The promoter used need not be identical to any naturally-occurring promoter. Guidance for 
the design of promoters is provided by studies of promoter structure such as that of Harley and Reynolds, Nucleic Acids 
Res., 15:2343-61 (1987) and papers cited therein. The location of the promoter relative to the first structural gene may 
be optimized. See Roberts, et al., PNAS (USA), 76 760-4 (1979). The use of a single promoter is favored. Suitable yeast 

45 expression systems are described in detail elsewhere in this specification. 

The vector used must be one having an origin of replication which is functional in the host cell. It desirably also has 
unique restriction sites for insertion of the globin genes and the desired regulatory elements and a conventional selectable 
marker. A vector may be nxxjified to introduce or eliminate restriction sites to make it more suitable for futher manipu- 
lations. 

50 The alpha and beta globin chains may be expressed either directly or as part o# fusion proteins. When expressed 
as fusion proteins, the latter may include a site at which they may be cleaved to release the alpha and beta globin free 
of extraneous polypeptide. If so, a site sensitive to the enzyme Factor Xa may be provided, as taught in Nagai and 
Thorgenson. EP App1 161.937, incorporated by references herein. Alternatively, the alpha and beta fuswn proteins may 
be synthesized, folded and heme incorporated to yield a henx)globjn analogue. 

55 The direct expression of the alpha and beta globin subunits is desirable. Factor Xa is a blood derivative. Preparations 
of Factor Xa may therefore contain undesirable Wood-associated sut5Stances oretiologic agents. In any event, the hemo- 
globin must be separated from the Factor Xa. 

Nagai and Thorgerson, EP AppI 161 ,937, incorporated by reference herein, teach the construction of fused genes 
in which DNA coding for a polypeptide of interest is immediately preceded by DNA encoding a cleavage site for Factor 
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Xa, a serine protease. Certain of the peptide sequences to be cleaved by Factor Xa are quoted below (wherein the 

cleavage site is denoted by an "="): 

lle-Glu-Aly-Arg=Val-His-Leu-Thr CM FxB-globin 

Me-Glu-Gly-Arg=Thr-Ala-Thr-Ser Hu prothrombin 

lle-Glu-Gly-Arg=Thr-Ser-Glu-Asp Bo prothrombin 

lle-Asp-Gly-Arg=lle-Val-Glu-Gly Hu prothrombin 

lle-Glu-Gly-Arg=lle-Val-Glu-Gly Bo prothrombin 

Ala-Glu-Gly-Arg=Asp-Asp-Leu-Tyr Hu antithrombin III 

In the above list. "CIIFXp-globin" refers to a hybrid fusion protein comprises the 31 amino-terminal residues of 
the lambdaCII protein, the Factor Xa recognition sequence "lle-Glu-Gly-Arg, " and the complete amino acid sequence 
of human beta globin (which begins "Val-Hts-Leu-Thr-..."). It will be evident from study of Figure 4 of the present invention 
that FX-alpha and FX-beta globins of Example 1 correspond to the native globin preceded by "Met-lte-Glu-Gly-Arg ' 

In bacterial mRNA, the site at which the ribosome binds to the messenger is a polypurine stretch which lies 4-7 
bases upstream of the start (AUG) codon. The consensus sequence of this stretch is 5'... AGGAGG...3'. and is frequently 
referred to as the Shine-Dalgarno sequence. Shine and Dalgarno, Nature, 254; 34 (1975). The exact distance between 
the SD sequence and the translational start codon, and the base sequence of this "spacer" region, affect the efficiency 
of translation and may be optimized empirically. Shepard, et al., DNA 1 : 125 (1985); DeBoer, et a!., DNA 2: 231 (1983); 
Hui, etal.. EMBO J., 3: 623 (1984). 

In addition, the SD sequence may itself be modified to alter expression. Hui and DeBoer, PNAS (USA), 84:4762- 
66 (1987). Corrparative studies of ribosomal binding sites, such as the study of Scherer, et al.. Nucleic Acids Res., 
8:3895-3907 (1 907), may provide guidance as to suitable base changes. If the hemoglobin is to be expressed in a host 
other than E eoli, a ribosomal-binding site prefen-ed by that host should be provided. Zaghbil and Doi, J. Bacteriol., 
168:1033-35(1986). 

Any host may be used which recognizes the selected promoter and ribosomal binding site and which has the capa- 
bility of synthesizing and incorporating heme. Bacterial and yeast hosts are preferred. 

The intracellularly assembled hemoglobin may be recovered from the producing cells and purified by any art-rec- 
ognized technique. 

Polvcistronic Co-Expression of Alpha and Beta Globins and Their Assembly Into Hemoglobin 

In one embodiment, expression of the alpha and beta globin genes is driven by a single promoter, and the genes 
are arranged so that a polycistronic messenger RNA transcript is transcribed, from which the separate alpha and beta 
globin polypeptides are subsequently translated. However, the present invention includes the co-expression of the alpha 
and beta globin genes from separate promoters, i.e.. the host transcribes separate alpha and beta globin mRNAs. 

The use of a single promoter is favored on theoretical grounds. Ideally, alpha and beta globin are expressed in 
stoichiometrically equal amounts. While use of a single promoter does not guarantee equality, it eliminates one unbal- 
ancing influence - differences in transcription owing to differences in promoter strength and accessibility. If differences 
in promoter strength were minimized by use of two identical promoters on the same plasmid, plasmid stability would be 
reduced as there would be a propensity toward recombination of the homologous regions. We note, however, that in 
preliminary experiments we have co-expressed alpha and beta globins from separate promoters. 

Another justification for using a single promoter is to minimize the number of repressor binding sites. 

Preferably, the alpha and beta globin genes are arranged so that the ribosome will translate the alpha globin cistron 
first The rationale is that there is some basis for t>elieving that alpha globin affects the folding of beta globin. Nonetheless, 
the position of the genes may be switched so that beta globin is synthesized first, as is shown in Example 6. 

The stability of the polycistronic mRNA transcript, the efficacy of its translation into alpha and beta globin, and the 
folding of the globin chains into tetrameric hemoglobin may be modified by varying the length and base sequence of the 
intercistronic regions (the region lying between the stop codon of one cistron and the start codon of the next cistron). 
the phasing of a second cistron relative to a first cistron, and the position and sequence of the ribosomal birxJing site 
for the one cistron relative to the preceding cistron. 

In a prefen-ed embodiment, the alpha and beta globin genes are each preceded by a short "introductory" cistron or 
"ribosomal loader" which facilities the subsequent translation of the globin cistron. In Figure 4, region A contains two 
cistrons and a Shine-Delgarno sequence preceeding each cistron. The first Shine-Delgarno sequence (S0#1) is bound 
by the ribosome, which then translates the first cistron. a short cistron encoding an octapeplide. (This cistron is referred 
to as an "introductory cistron or ribosomal loader.) The second cistron is a globin gene. In this case, an FX alpha-gtobin 
gene. The Shine-Delgarno sequence (SD#2) for facilitating translation of the second cistron actually lies within the first 
cistron. For this reason, the two are said to be Iranslationaily coupled". Region B is identical in structure, except that 
the second cistron encodes FX-beta globin. Between regions A and B is a 43-base intercistronic region. The Introductory 
cistrons of regions A and B correspond to the first cistron of the two-cistron expression system denoted pCZ144 in 
Schoner, et al., Meth. Enzymol.. 153: 401-16 (1987). The present invention is not, however, limited to the particular 
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"starter" cistron taught by Schoner, et al.; other introductory cistrons that allow for restart of high level translation of a 
following cistron may be used. 

Guidance as to the design of intercistronic sequences and as to the location of SD sequences may be obtained by 
comparing the translational efficiency of spontaneous or controlled nfiutants of the same polycistronic operon. as exem- 

5 plified by Schoner. et al., PNAS. 83: 8506-10 (1 980). It is also possible to look for consensus features in the intercistronic 
regions of different operons. McCarthy, et al., EMBO J., 4: 519-26 (1985) have identified a translation-enhancing inter- 
cistronic sequence in the E. coli atp operon. 

The present invention is intended to reduce or avoid the localization of the hemoglobin or its component polypeptides 
into inclusion bodies. Consequently, a further feature of the invention is that the functional hemoglobin is substantially 

10 found (preferably over 80%) in the soluble fraction of the cell. It appears that with this invention, over 907o of the functional 
hemoglobin can be so directed when alpha2 beta2 hemoglobin is assembled from alpha-and beta-globin chains coex- 
pressed from a tetracistronic operon as described herein. With di-alpha, beta2 hemoglobin, nearly 1 00% is soluble when 
expression is induced at 25'C and less at higher induction temperatures. These percentages reflect the percent of all 
di-alpha and beta chains found in the soluble fraction of the cell and not actual recovery of protein from the cell. 

15 

Expression in Yeast 

In another embodiment the present invention relates to the production of hemoglotDin-like molecules in yeast. Our 
preferred host for expression of recombinant human hemoglobin in yeast is Saccharomyces cerevisiae . However, other 

20 fungi or yeast may be used for the purpose, such as strains of Aspergillus or Pichia . For yeast to be a suitable host it 
must be capable of being transformed with recombinant vectors, either replicating or integrating types. This allows the 
insertion of the desired DNA sequence for the gene of interest. It must also be capable of high density cell growth, in 
appropriate volume to provide sufficient cell mass to isolate the desired gene product from the desired reaction vessels, 
where ideally the growth would be easily controlled by several parameters including nutrient formulation, agitation and 

25 oxygen transfer and temperature. It is also desirable to be able to induce the expression of protein synthesis with the 
manipulation of the media, temperature, or by the addition or consumption of certain chemicals. Finally, to be a suitable 
host, the yeast must be capable of producing recombinant proteins, preferably in excess of 1% of the total cell protein. 
This allows more faciie isolation of the desired recombinant protein. 

With reference to S. cerevisiae. haploid strains of potential use include: 

30 



BJY3501 


Mata pep4::HIS3 prbl-A 1 .6R his3 200 ura3-52 cani GAL, 


GSY112 


As above but Leu2::HISQ 


GSY112clr 


As above but cured of 2\l plasmid 


BJY3505 


Mata pep4::HIS3 prbl-A 1.6RHIS3 Iys2-208trpl-101 ura3-52 gal2can1 


35 GSY113 


As above but Ieu2: HISG 


RSY330 


Matapep4-3 prb1-1 122 hist? ura3-52 trpi -289 cani gall 


BJY2168 


Mata prcl-407 prt)1-1 122 pep4-3 Ieu2 trp1 ura3-52 


BJY1991 


Mata prbM 122 pep4-3 Ieu2 trpi ura3-52 


RSY334 


Mata regl-501 pep4-3 pit)1-1122 ura3-52 leu2-3. 112 gall 


40 RSY214 


Mata pep4-3 prbl ura3-52 



To date, strains such as GSY1 1 2. GSY1 1 3 and RSY334 have been the best hemoglobin producers. Strains such 
as RSY334 that carry the regl-501 mutation may be particularly important as they uncouple glucose repression from 
galactose induction, allowing one to induce with lower levels of galactose in the presence of glucose. Because this strain 
45 carries the gall mutation it cannot metabolize galactose, so the galactose concentration remains constant and continues 
to function as a inducer. 

Diploid streiins formed from crosses of any of the above compatible strains or similar compatible strains may also 
be useful as they tend to have faster growth rates than haploid strains. 

For example, the following diploid strains, which coexpress alpha (or di-alpha) and beta globins, are described in 
50 the Examples: 

BJY350 [pGS4988] x RSY330 [pGS4688] 
BJY3505 [pGS4988] x BJY 1991 [pGS4688] 

Other matings may likewise be used In practicing the present invention. 
The use of protease<feficient strains may also be advantageous. 
55 Yeast expression systenns can be divided into two main categories: (1) Systems designed to seaete protein and 
(2) system designed for the cytoplasmic expression of proteins. The advantages of secretion systems are: 

(1) The protein is often easier to purify from culture medium than from total cell extracts. 
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(2) If the proteins has essential disulfide bonds, they are more likely to form if theprotein passes through the secretory 
pathway. This is thought to be partly due to the presence of protein disulfide isomerase in the endoplasmic reticulum 
and to a less reducing environment than in the cytoplasm, 

5 (3) Secretion can also be advantageous if the first amino acid (after the initiating methionine) creates a dipeptide 
sequence that is poorly processed by methionyl aminopeptidase. The addiction of a secretory signal sequence may 
allow processing by signal peptidase during secretion resulting in a protein with the authentic amino acid at the 
amino terminus of the protein. 

10 The disadvantages of a secretion system are: 

(1) Generally, the expression level are much lower than those that can be obtained by cytoplasmic expression. This 
seems to be, in part, due to a rate limiting step in secretion. 

15 (2) Not all proteins are secretable. 

(3) often, particularly with S. cerevisiae, misprocessed forms of the protein accumulate and these can be difficult to 
purify away from correctly processed forms. 

20 The most commonly used secretory expression system used in yeast is the yeast a-factor secretory signal 
sequences and the a-factor promoter. See A.J. Brake. Yeast Genetic Enoineering Eds. P. Barr, A. Brake, and P. Valen- 
zuela. Butterworth Publishing, Boston 1989, for a review. The invertase signal sequence coupled to a variety of promoters 
has also been used to express heterologous proteins in yeast. See G. Stetler et al., Biotechnology. 1:55-60 (1989). R. 
Smith etal., Science, 229 1219-1229(1985). 

25 The advantages of cytoplasmic expression are: 

(1) Expression levels can be quite high with reports of proteins expressed at >20% of the total cell protein, usually 
as a soluble protein. 

30 (2) Some proteins cannot be efficiently secreted. 

(3) Proteins that contain glycosylation sites, if secreted from yeast, will be glycosylated with the pattern of sugars 
unique to yeast. Because these highly hydrophilic, post-transiational modifications lie on the surface of proteins they 
are likely to confer antigenicity on the protein. If the protein is functional without the carbohydrate side chains it may 

35 be advantageous to produce the protein without them rather than with the yeast-spedfic modification (see for ex. 

Travis et al. 1985. J Biol Chem 260 4384^389). 

(4) Some proteins have other specific modifications that may occur only in the cytoplasm, for example aminoterminal 
acetylation, rrxxJif ication with lipids, perhaps heme incorporation and so forth. 

40 

At present, cytoplasmic expression is preferred since the yeast cells fold together the globin chains and incorporate 
heme to produce hemoglobin in vivo . However, it is possible to separately express and secrete the alpha and beta globin 
chains and assemble hemoglobin in vitro . 

The globin genes must be placed under the control of a suitable promoter. The commonly used yeast promoters 

45 generally fall into two broad categories: regulated and constitutive. Constitutive promoters that are in wide use include 
GAP. PGK (phosphoglycerate kinase) and the a-factor promoter. Regulated promoters have also been used and these 
include the yeast metallothionein pronwter (regulated by copper), the Gall -10 promoter. GAL7 promoter (regulated by 
galactose and glucose) the ADHII promoter (regulated by ethanol and glucose) the PH05 promoter (phosphate regula- 
tion) and several hybrid promoters such as PH05-GAP. GAL-PGK. ADHII-GAP, and GAL-CYC1. 

50 The use of regulated promoters may be important for plasmid stability Often expression of recombinant proteins, 
at high levels, inhibits the growth of the host organism. This disadvantage can result in the accumulation of cells in the 
population with few copies of the plasmid preserrt. a decrease in growth rate and a drop in overall expression levels. By 
maintaining the gene in a repressed state and inducing late in the fermentation, high growth rates can be maintained 
and selection against high plasmid copy numbers can be avoided. 

55 It is somewhat difficult to obtain accurate data on the relative strength of yeast promoters because the only true 
measure of this would be based on kinetics of mRNA synthesis. Most of the data that exists measures only the final 
protein concentration that has been obtained. Because there can be great differences in protein stability, it Is necessary 
to compare the same protein and mRNA with multiple promoters on identical vectors. A study that approaches this was 
done by Verbakel filjaL (Gene. £1:207-215 (1987)). They compared the GAPDH. CYC1 GAL7, PH05. and PGK pro- 
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moters and measured the expression of a p-galactosidase/poliovirus VP2 protein fusion. CYC1 resulted in an expression 
level of 0.14% of the total cell protein (TCP); GAPDH, 0.22% TCP; PH05. 0.26% TCP; PGK, 0.9% TCP; and GAL7, 

0.96% TOR 

Using our GALGAP hybrid promoter, we have obtained expression levels of hemoglobin that represents si 5% of 
the total cell protein This most probably represents an increased promoter strength along with a highly stable protein 
product and perhaps mRNA with an extended half life. 

The use off a GAL-GAP hybrid promoter is preferred. Both elements (the GAL^as GAP transcnptional 

initiation site) are well understood. Studies on the mechanisms of transcriptional regulation off the GAL regulon have 
been fairly extensive. The galactose regulon includes five genes that encode enzymes required for the utilization of 
galactose. Four off these genes (GAL1, GAL7, GAL10, and GAL2) are expressed only in the presence of galactose. 
Galactose induction does not occur in the presence of glucose unless the yeast strain bears a mutation in the REGI 
gene. The GAL1 , 7, 10 and 2 genes are regulated by at least two other genes, GAL80 and GAL4. The GAL4 gene is a 
transcriptional activator protein that activates mRNA synthesis from the GAL1 , 7, 10 and 2 upstream activator sequences 
(UASqal)- Although GAL4 is constitutively expressed, it is functionally silent in the absence of galactose. Repression 
of GAL4 activity, in the absence of galactose is maintained by the product of the GAL80 gene. The GAL80 protein 
apparently interacts physically with GAL4 to prevent transcriptional activation. Presumably galactose or a galactose 
derivative prevents this interaction to allow GAL4 mediated induction. 

Haploid strains of S.cerevisiae have three different genes encoding the enzyme glyceraldehyde-3-phosphate dehy- 
drogenase (GAP). These genes have been designated TDH1, TDH2 and TDH3 and each is present as a single copy 
per haploid genome. The TDH3 gene produces approximately 60% of the cell's GAP enzyme and TDH1 and 2 produce 
about 12% and 28%. respectively (McAllister, L and M.J. Holland. 1985. J. Biol Chem. 260: 15019-15027). Holland's 
group (Holland et al. 1981. J. Biol Chem, 2^ 1385-1395; and Holland et al. 1983. J Biol Chem 2^:5291-5299) has 
cloned and characterized the three GAP genes of S.cerevisiae. The clones have been designated pGAPl 1 , pGAP63. 
and pGAP491 . pGAP491 corresponds to the TDH3 gene and is therefore, the most highly expressed. This promoter 
has been used to express a wide variety of proteins in yeast including: 



Protein 


REF 


a Interferon 


(1) 


Hepatitis B antigen 


(1) 


Thaumatin 


(2) 


Hepatitis B antigen 


(3) 


HIV III reverse transcriptase 


(4) 


human SOC 


(5) 


al antiprotease 


(6) 


(1) Bitter. G. and K.M. Eagan. 1984. Gene. 22.263-274. 

(2) Edens, L. et al. 1984. Cell, SZ:629-633. 

(3) Kitano, K. etal. 1987 Biotechnology 5:281-283. 

(4) Hallewell. R.A. et al. 1987. Biotechnology 5:363- 
366. (5) Barr. PJ. et al. 1987 Biotechnology 5:486-489. 
(6) Travis, J. et al. 1965. J Biol Chem 2ga:4384-4389. 



This promoter is commonly used as a 600-850bp fragment and is essentially un-regulated. In its long form this is a 
very powerful promoter. The form we are using consists of only ~2Q0bp 5' of the translational initiation site. This form, 
with no added enhancer sequences is substantially less active than the longer form of the promoter (Edens, L. et al. 
Cell, 27:629 (1984)). Our addition of the GAL enhancer region confers both regulation and high levels of expression. 
With only the GAP491 promoter, alpha and beta globin were produced at a level of less than 0.2% total cell protein; with 
the GAL-GAP491 hybrid promoter, expression jumped to 7-10% total cell protein. 

Several other hybrid promoters are of particular interest; 
GAL-SIGMA 

A strong galactose- regulated promoter with the sigma transcriptional start site. 
SIGMA-GAP 

A strong peptide hormone-regulated promoter with the GAP491 transcriptional start site. 
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GAL-EFIII 

A strong galactose-regulated pronnoter with the elongation factor III transcriptional start site. 
SIGMA-EFIII 

A strong peptide hormone-regulated promoter with the elongation factor III transcriptional start site. 

5 One could easily conceive of other promoter systems that would also work. This would include, but not be limited 
to, a variety of constitutive promoters. For example, the yeast mating factora (MFcz) promoter or the mating factor a 
promoter MF{a), the phosphoglycerate kinase promoter (PGK), hexokinasel. hexokinase2, glucokinase, pyruvate 
kinase, triose phosphate isomerase, phosphoglycerate isomerase, phosphoglycerate mutase, phosphofructose kinase 
or aldolase promoters may all be used. In short, any well expressed yeast promoter may work for expression of hemo- 

10 gjobin in yeast. A wide variety of naturally occurring, regulated promoters could also be used, for example: GALl-10. 
GAL7. PH05, ADHII have all been used to produce heterologous proteins in yeast. A variety of synthetic or semi-synthetic 
yeast promoters could also be employed such as GAL-PGK. GAL-MFa-1, GAL-MFa1, GAL-SIGMA. ADHII regulatory 
sequences could also be coupled to strong transcriptional initiation sites derived from a variety of promoters. The PH05 
regulatory sequence or the sigma element regulatory sequences could also be used to construct powerful hybrid pro- 

15 moters In addition to yeast promoters, it is conceivable that one could use a powerful prokaryotic promoter like the T7 
promoter. In this case, one could place the T7 polymerase under the control of a tightly regulated yeast promoter. Induc- 
tion of the phage polymerase In yeast cells bearing hemoglobin genes under T7 promoter regulation would allow tran- 
scription of the genes by this very efficient phage polymerase, 

Because most of the yeast regulatory sequences described above serve as targets for proteins that are positive 

20 regulators of transcription, It is conceivable that these proteins may limit transcription in situations where the target 
sequence is present in many copies. Such a situation may obtain with vectors such as pCI B, pCIT, pCI U or pCl N which 
may be presem in excess of 200 copies per cell. Over-expression of the positive regulator (for example GAL4) may result 
in enhanced expression. It is possible to construct a strain in which the GAL4 gene is altered to remove its promoter 
and the promoter replaced with the GAL7 or GAL1 -1 0 promoters, both of which are transcribed more efficiently than the 

25 GAL4 promoter. In this situation, the positive transcriptional activator protein GAL4 would be expressed at elevated level 
at the time hemoglobin expression was induced. 

The consensus sequence for higher eukaryotic ribosome binding sites has been defined by Kozack (Ceil, 44:283- 
292 (1986)) to be: G'^qCGAUGG. Deviations from this sequences, particularly at the -3 position (A or G). have a large 
effect on translation of a particular mRNA. Virtually all highly expressed mammalian genes use this sequence. Highly 

30 expressed yeast mRNAs, on the other hand , differ from this sequence and instead use the sequence AAAAAUOU (Cigan 
and Donahue. Gene. 59:1-18 (1987)). The ribosome binding site that we use for expression of the a and p-globins 
corresponds to the higher eukaryotic ribosome binding site. It is within the contemplation of this invention to systematically 
alter this RES to test the effects of changes that make it more closely resemble the RBS of yeast. It should be pointed 
out, however, that alterations at the -2, - 1 and +3 positions, in general, have been found to only slightly affect translational 

35 efficiency in yeast and in mammals. 

Intracellular expression of genes in S. cerevisiae is primarily affected by the strength of the promoter associated 
with the gene, the plasmid copy number (for plasmid-borne genes), the transcription terminator, the host strain, and the 
codon preference pattern of the gene. When secretion of the gene product is desired, the secretion leader sequence 
becomes significant. It should be noted that with multicopy plasmids. secretion efficiency may be reduced by strong 

40 promoter constructions. Ernst. DNA 5:483-491 (1986). 

A variety of extrachromosomally replicating vectors (plasmids) are available for transforming yeast cells. The most 
useful multicopy extrachromosomal yeast vectors are shuttle vectors that use a full length 2fi -circle combined with an 
E coli plasmid. These vectors carry genes that allows one to maintain the plasmid in appropriate yeast mutants and 
antibiotic resistance markers that allow selection in E coH. Use of the full-length 2jyi -circle, in contrast to vectors containing 

45 only a partial 2\l sequence, generally results in much higher plasmid stability, particularly in yeast strains that have been 
cured of endogenous 2\l plasmid. The pC series of vectors described herein are vectors of this type. 

Strains could also be constructed in such a way that the GALGAP hemoglobin expression cassettes were integrated 
into chromosomes by using yeast integrating vectors. Although the copy number of the hemoglobin genes would be 
lower than for plasmid vectors, they would be quite stable and perhaps not require selection to be maintained in the host 

so cell . Yeast i ntegrating vectors include Yips (Struhl, et al , PNAS, 76: 1 035-39. 1 989). Yip 1 (Jd ). and pGT6 (Tchumper and 
Carbon. Gene, 10:157-166, 1980). For information on these and other yeast vectors, see Pouwels. et al.. Cloninc Vector. 
Vl-I,slssa. (Elsevier, 1985). 

The alpha arxl beta globin genes may be introduced by separate plasmids, or both upon the same plasmid. The 
advantage of a single plasmid system over a double plasmid system is theoretical. It is generally thought that there is 
55 an upper limit to the total number of plasmid copies per cell. If it is 1000, for example, the two plasmid system could 
have only 500 copies of a-chain plasmid and 500 of the p-chain plasmid. A single plasmid of 1000 copies per cell would 
bear 1000 copies of each a- and p-chain gene. The number of copies may be irrelevant, however, if other factors are 
limiting, in fact, several groups favor using strains that contain genes integrated into various chromosomal loci. Such 
strains very stably maintain the foreign gene and do not require special media to maintain selection for the plasmid. 
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Highly expressed yeast genes show a very high codon bias. The genes encoding glyceraldehyde-3-phosphate dehy- 
drogenase and ADH-I, for example, show a 90% bias for a set of 25 codons. Highly expressed yeast genes (>1% of the 
total mRNA) have yeast codon bias indices of >.90. Moderately expressed genes {0 1- 05% of the total mRNA) have 
bias indices of 0.6-0.8. and genes expressed at low levels (>0.05% of the total cell protein) have a codon bias off 0.10- 

5 0.50 (Bennetzen and Hall. J. Biol. Chem., 257:3026-3031 (1982)). The calculated value for the codons of the human a- 
globin cDNA is 0.23. A similar value can be calculated for the p-globin cDNA. Because there is a very high correlation 
between the most commonly used codons, it is possible that hemoglobin expression from the human cDNA in yeast 
may be limited by the availability of the appropriate tRNA molecules. If this is so, a complete synthesis of the gene using 
the most highly favored yeast codons could improve the expression levels. It is quite possible that the greatest negative 

10 effect of adverse codon use would be if there was an abundance of codons used in the cDNA that are represented by 
low abundance tRNAs. In such a case, high level expression of hemoglobin could completely drain that pool of tRNA 
molecules, reducing translation not only of hemoglobin but of yeast proteins that happen to use that codon as well. In 
the case of the a-globin human cDNA, the most commonly used leucine codon is CTG (14 of 21), this codon is never 
used in highly expressed yeast genes (Guthrie and Abelson. The Molecular Biology of the Yeast Saccharomyces. Eds. 

75 Stratern, Jones and Broach. 1 982. Cold Spring Harbor. NY). The low codon bias index and the presence of rare yeast 
codons in the globin cDNAs have been sufficient incentive for us to synthesize a modified form of the alpha- and beta- 
globin genes using the preferred yeast codons. 

"Di-Alpha" and Pi-Beta" Hemoglobins 

20 

The present invention further contemplates in some emtKXiiments the combination of (a) one molecule of a di-alpha 
globin-like polypeptide with two molecules of a beta globin-like polypeptide to form a "di-alpha" hemoglobin-like protein; 
(b) two molecules of an alpha-globin-like polypeptide with one molecule of a di-beta globin-like polypeptide to form a 
"di-beta" hemoglobin-like protein; or (c) one molecule of a dl-alpha globin-like polypeptide with one molecule of a di- 
25 beta globin-like polypeptide to form a "di-alpha/di-beta" hemoglobin-like protein. 

It should further be noted that the delta, gamma and epsilon chains have considerable homology with the beta chain 
and that the zeta chain has considerable homology with the alpha chain. Di-delta, di -gamma, di-epsilon add di-zeta 
polyp^tides are therefore witiiin the compass of the invention and may be used In the preparation of novel hemoglobins 
of types other than Hgb A1 . 

30 Preferably the di-alpha linker (if one is used) consists of 1 to 3 amino acids which may be the same or different. A 
Mono-Gly linker is especially preferred. In designing such a linker, it is important to recognize that it is desirable to use 
one or more amino acids that will flexibly connect the two subunits, transfornfiing them into domains of a single di-alpha 
globin polypeptide. 

The preparation of "di-beta" mutants is also contemplated. The distance between the N-terminus of one beta subunit 
35 and the C-termlnus of the other is 18.4 A in the deoxy configuration and 5.2 A in the oxy form. Preferably, the di-beta 
linker consists of 4 to 9 amino acids which may be the same or different. 

The linker should be unlikely to form a secondary structure, such as an alpha helix or a beta-sheet. Certain amino 
acids have a greater tendency to participate in such structures. See Chou and Fasman, Biochemistry. 1 3:222-245 (1 974), 
incorporated by reference. The amino acids are ranked in order of decreasing participation below. The preferred linker 
40 amino acids are boldfaced. Glycine is the most suitable amino acid for this purpose. The most preferred di-alpha linkers 
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Alpha Helix Formers 


Beta Sheet Formers 


Glu(1 53) 


Met (1.67) 


Ala (1.45) 


Val (1.65) 


Leu (1.34) Ha 


lie (1.60) H6 


His (1.24) 


Cys{1.30) 


Met (1.20) 


Tyr(1.29) 


Gin (1.17) 


Phe(1.28) 


Val (1.14) 


Gin (1.23) 


Trp (1.14) 


Leu (1.22) 


Phe(1.12) ha 


Thr (1.20) 


Lysfl.07) 


Tro f1.19)h6 


lie (1.00) 


Ala (0.97) ip 


Asp (0.98) 


Arg (0.90) 


Thr (0.82) 


Qiy(0.81) 


Arg(0.79) 


Asp (0.80) ip 


Ser (0.79) 


Lys (0.74) 


Cvs fO.77) la 


Ser (0.72) 


Ml (0.73) 


His (0.71) 


Tyr(0.61)ba 


Asn (0.65) 


Pro (0.59) 


Pro fO.62) bp 


Oly (0.53) Bfl 


Glu (0.26) Bf> 


(The letter symbols are Ha. strong a former; ha, a former; la; weak a former; ia, a irKiifferent; ba, a breaker; 
and Ba strong a breaker. The p symbols are analogous. Trp is bp if near the C-terminal of a p-sheel region.) 



The alpha helix of a polypeptide chain comprises an average of 3.6 residues per turn. In globular proteins, the 
average length is about 17A, corresponding to 1 1 residues or 3 helix turns. In alpha and beta globin, the helices range 
in length from 7 to 21 amino acids (A.A). The beta pleated sheet comprises 2.3 residues per turn; the average length is 
about 20A or 6 residues. 

Chou and Fasman define an alpha helix nucleus as a hexapeptide containing four helix forming residues and not 
more than one helix breaker, and a beta sheet nucleus as a pentapeptide containing three beta sheet forming residues 
and not more than one sheet breaker. 

The amino acid sequence in the vicinity of the di -alpha linker is as follows: 



residue # 


138 


139 


140 


141 


1 


2 


3 


4 


AA 


Ser 


Lys 


Tyr 


Arg - (XXX) ^ 


Val 


Leu 


Ser 


Pro 


Helix Not 


H21 


HCl 


HC2 


HC3 


NAl 


NA2 


Al 


A2 


Helix Pot 


079 


107 


061 


079 


114 


134 


079 


059 


Sheet Pot 


072 


074 


129 


090 


165 


122 


072 


062 
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(Note: Helix- and sheet forming potentials have been 
multiplied by 100 for typographical reasons.) 



The di-alpha linker is preferably only 1-3 amino acids. Thus, it can form an alpha helix only in conjunction with the 
linker "termini". A one or two residue linker, even if conposed of amino acids with strong secondary structure propensities, 
would be unlikely to assume an alpha helix or beta sheet configuration in view of the disruptive effect of. e.g., Arg 141 
or Ser 3. tf the linker is 3 residues long, it would be preferable that no more than one residue be a strong alpha helix 
former, unless the linker also included a strong alpha helix breaker. 

The amino acid sequence in the vicinity of the di-beta linker may Impose more stringent constraints. 



143 


144 


145 


146 


1 


2 


3 


4 


His 


Lys 


Tyr 


His - 


(XXX) - Val 


His 


Leu 


Thr 


H21 


HCl 


HC2 


HC3 


NAl 


KA2 


NA3 


Al 


124 


107 


061 


124 


114 


124 


134 


082 


071 


074 


129 


071 


165 


071 


122 


120 



The di-beta linker is longer (preferably 4-9 A. A.) and therefore more susceptible to secondary structure formation. 
It is desirable that the amino acid adjacent to Val-1 be an alpha helix breaker in view of alpha-helix propensities of Val- 
His-Leu. More generally, it is desirable that the linker not contain (or cooperate with the proximately linked amino acids 
to form) an alpha helix nucleus or beta sheet nucleus. 

Amino acids with a high propensity toward alpha helix formation may be used in the linker If accompanied by "helix 
breaking" amino acids. Similarly. Beta sheet formation may be prevented by "sheet disrupting" amino acids. 

Of course, prediction of secondary structure using Chou and Fasman's approach has its limitations and the ultimate 
test of the acceptability of a linker is whether or not the di -alpha or di-beta hemoglobin has the desired affinity for oxygen. 
In particular, a poly-alanine linker, despite its supposed propensity to alpha-helix formation, may welt be of value since 
the alanine group is compact and therefore the linker shouW be quite flexible if secondary structure does not form. 

In an especially preferred embodiment, di-alpha and beta globin genes are combined Into a single polycistronic 
operon. The use of a polycistronic operon is not, however, necessary to practice the present Invention, and the alpha 
(or di-alpha) and beta (or di-beta) gtobin genes may be expressed from separate promoters which may be the same or 
different. 

While the preferred "genetically fused hemoglobin" of the present invention is one comprising a di-alpha and/or di- 
beta globin, other globin chains may be genetically fused and used in the production of hemoglobins of species other 
than Hgb Al (a2p2)- 

The appended claims are hereby incorporated by reference as a further enumeration of the preferred embodiments. 
All cited references are incorporated by reference to the extent necessary to enable the practice of the invention as now 
or hereafter claimed. 

By the means suggested in this specification, we have had the foilcwing acNevements: 

(1) Expression of Met-FX-alpha globin in E. coli from a dicistronic operon comprising a introductory cistron and a 
Met-FX-alpha globin gene, both transcribed from a Tac promoter. (Example 2) 

(2) Expression of Met-FX beta globin in E, coli by similar means. (Example 2) 

(3) Co-expression of Met-FX-alpha globin and Met-FX-beta globin in E. coli from a tetracistronic operon comprising 
an introductory cistron. an FX-aipha globin gene, an intercistronic sequence, a second Introductory cistron, and an 
FX-beta globin gene, all controlled by a single Tac promoter. The alpha and beta globins were intracellularly assem- 
bled into functional FX hemoglobin. The FX Hgb was enzymatically converted to Hgb. (Example 3) 

(4) Co-expression as in (3) above of mutants of FX-hemoglobln in which the beta globin subunits possessed the 
Beth Israel, Cheverly, Providence/MSR, Kansas or beta67 Val lie mutations. (Example 4) 

(5) Co-expression as in (3) above of Met-alpha globin and Met-beta globin and intracellular assembly into Met-Hgb 
(a.k.a. Des-FX-Hgb). (Example 5) 
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(6) Co-expression as in (3) above of Des-Va!-alpha globin and Des-Val-beta globin. (Example 6) 

(7) Co-expression as in (6) above, but with the Des-Val-beta globin gene preceeding the Des-Val-alpha globin gene 
within the operon. (Example 6) 

(8) Co-expression as in (3) above of (Des-Val)-alpha-(Gly-Gly) -alpha globin and beta-globin and intracellular assem- 
bly into a di-alpha Hgb. (Example 8) 

(9) Co-expression as in (8) above of (Des-Val-alpha-(GlyGly)-alpha globin and a mutant (Nagai. Arg-Nagai, or Kan- 
sas) of beta globin and intracellular assembly into a mutant di-alpha Hgb. (Exarrple 9) 

(10) Co-expresion as in (3) above of Des-Val-alpha globin and a Des-Val beta-globin with the Presbyterian mutation 
and intracellular assembly into a mutant Des-Val Hgb. (Example 1 1). 

(11) Co-expression as in (8) above of a (Des-Val)-alpha-(GlyGly)-alpha globin and of a betal globin with the Pres- 
byterian mutation. (Example 11). 

(12) Expression of dl-alpha globin and beta-globin from separate promoters on the came plasmid (Example 12). 

(13) Devised Hypothetical Protocol for Co-expression of di-alpha globin and two copies of beta globin from same 

operon (Example 13). 

(14) Devised Hypothetical Protocol for Co-produciton of Di-beta Hgb (Example 14). 

(15) Devised Hypothetical Protocol for co-expression of alpha globin and beta-globin under control of separate 
promoters on the same plasmid (Example 16). 

(16) Devised Hypothetical Protocol for co-expression of alpha and beta-globin under control of different promoters 
on different plasmids (Example 1 7). 

(17) Co-expression of a and p globin in S. cerevisiae (Example 19). 

(18) Co-expression of di-a globin and p-globin in S. cerevisiae . (Example 23). 

(19) Construction of Des-Val-alpha globin arxJ Des-Val-beta globin in E. coli under lambda Pl control. 

(20) Co-Expression of di-alpha globin and Des-Val-bela globin in E. coli under lambda Pl control. 

(21) Co-Expression of alpha and beta globin from separate plasmids in diploid strains of S. cerevisiae . 

(22) Co-Expression of di-alpha globin and beta globin Presbyterian mutant in S. cerevisiae . 

(23) Preparation of vectors for expression of di-alpha globins with -Gly- or -Pro- linkers. 

(24) Evaluation of different strains and induction temperatures for expression of di-alpha Hgb in E 

(25) Co-Expression of alpha and mutant beta globin in S. cerevisiae . assembling to form low-affinity Hgb mutants. 

An unexpected and surprising change in oxygen binding characteristics of hemoglobin was observed upon replace- 
ment of the N-terminal valine with methionine. As illustrated in Example 11, hemoglobin A^ purified from Wood has a 
P50 value of 4.03 with N»2.7. DesFX-hgb produced in E coli, a hemoglobin identical to A^ except for the addition of a 
methionine at the N-termini of the alpha and beta chains, has essentially the same P50 and N values. (Example 7). Thus, 
the addition of a methionine, without altering the adjacent valine residue, has little or no effect on oxygen binding. On 
the other hand, a two-fbid higher P50 value, 7.04, was observed for desVal-hgb produced in E. coli . a hemoglobin in 
which the normal N-terminal valine of each chain was replaced with methionine. Cooperatlvlty, as measured by N, was 
the same, however, for this molecule as for the others. A similar comparison was made for two hemoglobins each con- 
taining fused alpha chains and each containing the Presbyterian mutation, one produced in E. coli (Example 11) and 
one in yeast. The E. coli hemoglobin was constructed with a Des-Val alpha chain, i.e.. the N-terminus had the normal 
valine replaced with methionine. Oxygen binding was characterized by P50=33 and N=2.4 (Example 11). The corre- 
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spending yeast coding region begins with an additional methionine codon in front of the normal valine codon. Because 
this initial methionine is removed post translationally in vivo, the purified hemoglobin has a normal N-terminal valine. 
For this molecule, P5o=23 to 25 and N=:2.5. Thus, in two quite different cases, the replacement of an N-terminal valine 
with an N-terminal methionine increased the P50 value. Under physiological conditions, it is expected that the fused 
Presbyterian hemoglobin produced in E coli will deliver 20-30% more oxygen than the similar hemoglobin, with its altered 
N-terminus, produced in yeast. 

A very large number of different plasmids are referred to in the Examples which follow. In order to highlight the 
relationships among these plasmids, a Table of vectors has been compiled (See Table 100). 

Example 1: Construction of FX-alpha Globin fpDLII-62m^ and BetaGlobln (pDLII-lOa) Expressi on Vectors 

Materials and Methods 

Unless otherwise stated all electroelutions. phenol/chloroform extractions, ethanol (EtOH) precipitations, alkaline- 
SDS plasmid purifications, agarose electrophoresis and DNA manipulations were carried out essentially as described 
by Maniatis et a! ("Molecular Cloning" Cold Spring Harbor. New York, 1982). 

The following abbreviations and definitions are used: ethylenediaminetetraacetic acid (EDTA); sodium dodecylsul- 
fate (SDS); polyacrylamlde gel electrophoresis (PAGE); dimethylsulfoxide (DfwISO); dithiothreitol (DTT): isopropyl-beta- 
D-thiogalactopyranoside (IPTG); 2xYT medium (16g bacto tryptone, lOg Bacto yeast extract. 5g NaCI per liter water); 
SDS-PAGE loading buffer (0.125MTris-HCI. pH6.8, 20% v/v glycerol. 2% SDS. 2%2-mercaptoethanol. 0.01%bromphe- 
nol blue); phosphate buffered IB medium (24g yeast extract, 12g tryptone. 4 mL glycerol. 17mL 1M KH2PO4, 72 mL 
1M K2HPO4 per liter water); Tris-EDTA buffer (TE: lOmMTris. pHS.O. IrnM EDTA); Tris-borate, EDTA buffer (TBE: 0.089 
MTris, 0.089 M boric acid, 0.002 M EDTA); Tris-acetate EDTA buffer (TAE: 0.04MTris. 0.04 M acetic acid, 0.001 M EDTA). 

Protein electrophoresis was performed by the method of Laemmli. U.K. (Nature 1970. 227 . 680-685) on 1 5% SDS- 
polyacrylamide gels. 

E.coll JM109 cells were nnade competent as follows: Two hundred milliliters of 2xYT medium was inoculated with 
2mL of an E.coll JM1 09 overnight culture. The 200mL culture was then Incubated with shaking at 37^0 for 1 hour. Cell 
were harvested by centrlfugation at 6000 rpm at 4'C for 4 min In a Beckman JS13.1 swinging txicket rotor (Beckrran 
Instruments, Inc., Palo Alto, CA). The cells were resuspended in 50mL total of a buffer containing 45mM MnCI2, 60 mM 
CaClz. 40niM KOAc, pH 6.2, 15% sucrose (w/v) 1 .3% RbCI (w^), and 7.5% (vA/) glycerol. Following centrifugation as 
eibove, the cells were resuspended In 20mL of the same buffer and Incubated at 0°C for 30 minutes. Cells were dispensed 
in one milliliter aliquots and stored at -80''C until used. 

Unless othenwise stated, all restriction enzyme digests were performed under conditions suggested by the manu- 
facturer. DNA concentrations wore determined by Beer's law using measured absorbance at 260 nm (A260) against a 
water reference standard. For synthetic oligonucleotides the following extinctions were used: E260=o.05mLyug/cm and 
for double stranded DNA E260=o.02 mUug/cm. 

Globin Gene Synthesis 

Each globin gene was constructed from 14 separate oligonucleotides ranging in length from 50-85 base pairs. The 
FX-alpha globin gene was synthesized from oligonucleotides SJH l-33a-f, SJH l-34a-f and SJH l-35a,b; FX-beta globin 
gene was synthesized from SJH l-36a-f. SJH l-37a-f arxl SJH l-38a,b. Each globin gene is proceeded by a short loader 
gene as previously described. Oligonucleotides were synthesized on a Biosearch 8600 instrument using beta-cya- 
noethylphosphoramidite chemistry on 1,000 angstrom CPG columns (0.2 mmole) (Beaucage, S.L and Caruthers. M.H. 
Tet. Lett. 1981, 22, 1859-1862). The sequence of these oligonucleotides is given in Figure 4. 

Unless othenwise stated, the oligonucleotides were cleaved from the columns using the following protocol: Approx- 
imately 0.5mL of fresh, concentrated NH4 OH was drawn into the column with a 1 mL syringe. The NH4 OH was allowed 
to react for 20min then expressed into a glass vial (approximately 4mL capacity). This process was repeated 2 times, 
the vial was filled to greater than 75% capacity with NH4OH. and heated at 55*C. overnight. The samples were lyophllized. 
resuspended In 0.1 mL H2 0 and the concentration estimated by measuring A26o- 

Two hundred micrograms of the Individual oligonucleotides were purified by urea polyacrylamlde gel electrophoresis. 
To do this, an equal volume of 2x loading buffer (90% formamide (v/v), O.SxTBE, 0.05% (wA^) bromophenol blue, 0.05% 
{wN) xylene cyanol) was added to the oligonucleotide. The sample was heated at 95''C for 1 0 min. and applied to a 1 0% 
acrylamide gel containing 7M urea and 1XTBE. Electrophoresis was at 800 voKs for approximately 2 hrs. The full length 
oligonucieotide was visualized under ultraviolet light. That region of the gel was then excised and incubated In 3mL of 
lOOmM Tris. pH 7.8, SOOmM NaCI, 5qiM EDTA buffer at 60«C, overnight. 

The oligonucleotide solution was further purified by reverse phase chromatography as follows: A CI 8 Sep-Pak 
cartridge (Waters Associates) was washed with lOmL 100% methanol followed by lOmL of H2O. The oligonucleotide 
solution was applied to the column, washed with 20mL H2O and eluted with 3x1 mL aliquots of 50mM triethylammonium 
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acetate, pH 7.3/methanol (1:1). The purified oligonucleotide was iyophilized, washed with 100% ethanol, dried, and 
resuspended in 0.1 mL H2O. The concentration was determined by A26O 

The synthetics FX-beta gene sequence (included in Figure 5) was constructed as follows: lOOpmole of the following 
oiigo nucleotides were kinased in 3 separate reactions. Reaction 1 contained oligonucleotides SJH l-36b, c, d, e, and f. 
Reaction 2 contained SJH l-37a, b, c. and e. Reaction 3 contained SJH l-37d, f, and SJH l-38a. After combining the 
appropriate oligonucleotides, the solutions were Iyophilized to dryness and resuspended in 16uL of H2O. Two uL of lOx 
kinase buffer (0.5M Tris-HCI, pH7.4. O.IM MgCl2). 0.5 uL of 1 0OmM DTT. and luL of I.OrnM ATP were then added. The 
reaction was initiated by addition of luL (2U) of T4 polynucleotide kinase (IBI, Inc., New Haven, CT). After Incubation 
at 37°C for 1 hour, the reactions were heated to 95°C for 10 minutes to inactivate the kinase. The three reactions were 
combined and lOOpmoles of oligonucleotides SJH l-36a and SJH l-38b were added. After addition of lOuL of lOOmM 
Tris, pH 7.8, lOOmM MgCl2, the oligonucleotides were allowed to anneal by incubating at 65°C for 30 min, 37°C for 
30mtn. and IS^C for 1 hour. Annealed oligonucleotides were ligated by addition of ATP {ImM, final) and DTT (10 mM 
final) and4uL(20U) T4 DNA ligase (IBI, Inc., New Haven, CT) and incubation at 15°Cfor 1 hour. Aliquots of this ligation 
mixture were then cloned directly into Ml3mp19 (see below). 

Oligonucleotides for the construction of FX-alpha globin were similarly purified, kinased, annealed, and ligated. 
Before ligation intoM13mp19, the full length FX-alpha globin gene was purified by electrophoresis through 0.8% agarose 
in 1XTAE buffer and electroeluted into dialysis tubing using Q.5X TBE as the electroelution buffer by the method of 
Maniatis et al. Eluted DNA was phenol extracted. EtOH precipitated, and resuspended in 20uL TE buffer Aliquots were 
used for cloning into M13mp19. 

Ph a g e Vectors 

For cloning, a 2, 5 or 10 fold molar excess of the individual FX-alpha and FX-beta globin gene sequences were 
combined with 200ng of double cut, gel purified M13mp19-RF (New England BioLalis. Inc., Beverly. MD) and ligated 
overnight at IS^'C in 50uL ligation buffer (IBI. Inc.. New Haven. CT) containing 2U of T4 ligase. FX-Alpha globin was 
cloned into the Xmal/PstI sites of M13mp19. FX-Beta globin was cloned into the Pst l /Hind lll sites of M13mp19. 

E.coli JM109 was transformed with the M13mp19 ligation mixture containing the FX-alpha or FX-beta globin gene 
sequences using the following transformation protocol: Nine microliters of DMSO was added to 0.25mL of competent 
Ecoli JM1 09 and the cells were incubated on ice for lOmin. Aliquots of the FX-alpha or FX-beta globin ligation reactions 
were added and incubated on ice for 40 minutes and at 42°C for 3 min. One hundred microliters of a JM109 overnight 
culture was added to each transformation mix followed by SOfiL of a solution containing 50^L of 2% (wA^) 5-bromo-4- 
chloro-3-indolyl galactopyranoside in dimethylfbrmamide and 10^L of lOOrnM isopropylthiogalactoside. Molten B-top 
agar (lOg Bacto tryptone, 8g NaCI, 6g agar per liter), 2.5mL was added and the mixture poured onto a B-tx)ttom agar 
plate (1 Og Bacto tryptone, 8g NaCI, 1 2g of agar per liter). Following incubation overnight at 37°C colorless plaques (i.e., 
clones containing inserted DNA) were removed from the plates using sterile transfer pipettes and inoculated into ImL 
of a JM109 overnight culture diluted 1 :100 in 2xYT media. 

Bacteriophage clones were grown at 37'*C for 6 to 8 hrs. and centrifuged in a microcentrifuge for 5 minutes. The 
cell pellets were processed for M13mp19 RT using the alkaline-SDS method and resuspended in 20[iL of TE buffer 
containing 20M,g/mL DNAasefree RNAase (Sigma, St. Louis, MO ). Aliquots (1 -3^1) of the RF preparations were digested 
with 10-20 units each of the appropriate restriction enzymes (see above) and analyzed on 0.8% agarose electrophoresis 
gels (see above). 

M13mp1 9 clones containing the correct size inserts were grown in larger quantities to obtain single stranded phage 
DNA for sequencing. Thirty-five milliliters of 2xYT medium was inoculated with 0.3mL of a JM1 09 overnight culture. After 
growth for 1 hour at 37°C, the 35mL culture was inoculated with 200|llL of the appropriate phage-containing supernatant. 
The culture was incubated for 6 hours at 37''C and the culture supernatant collected by centrifugation at 9000 rpm for 
1 0 min in a JS13. 1 rotor. The phage ware precipitated from 31 mL of supernatant by addition of 5mL 4M NaCI and 4mL 
40% (w/w) polyethylene glycol 6000 and recentrifuged as above. The phage pellet was resuspended in 0.4mL of TE. 
extracted with phenol/chlorofornVisoamyl alcohol (50:49:1 (v/v)) three times, chlorofbrm/isoamyl alcohol (49:1 (v/v))one 
time, and ethanol precipitated. The DNA pellet was resuspended In 20^L TE and quantitated spectrophotometrically by 
Azeo- One microgram of phage DNA was used per set of sequencing reactions. 

Sequencing was by the dideoxy method of Sanger (Sanger, F.S. fit ai- Proc. Nat. Acad. Sci.. USA 1977, 74. 5463- 
5467) with M13 -20 and Ml 3 -40 universal primers (New England BloLabs, Beverly, MA) and gene specific primers (for 
FX-alpha globin the following primers were used: alpha- 1 5'-CGTATGTTCCTGTCTTT-3'; alpha-2 5-ACAAACTGCGT- 
GTTGAT-3': for FX-beta globin the following primers were used: beta-1 5'-GCTGGTTGTTTACCCGT-3*; beta-2 5'- 
ACCCGGAAAACTTGCGTC-3'). 



23 



EP 0 700 997 A1 



Expression Vectors pDL ll-62m and pDL 11-1 Oa 

The appropriate sequences were excised from the M1 3mp1 9 vector and cloned into the pKK-223-3 (Pharnnacia/LKB, 
Piscataway, NJ)) expression vector under control of the Tac promoter (Brosius. J. and Holy, A. Proc. Nat. Acad. Sci.. 
USA 1984. 81, 6929-6933). A DNA sequence encoding alpha globin was removed by cutting with EcoRI and Pstl. The 
globin containing fragment was gel purified and ligated into EcoRI and Pstl double cut, gel purified pKK-223-3 using 
methods described above. E.coli JM109 cells were transformed with the ligation reaction containing the desired FX- 
alpha globin sequence (pDL ll-62m) (Figure 1) and selected for by growth on 2xYT media containing ampiciilin (100 
ng/mL). Individual clones were screened for the presence of the desired insert (yielding pDL ll-62m) by alkaline-SDS 
purification of plasmids and restriction analysis with the enzymes EcqRI and Pstl using the methods described above. 
Cloning of the FX-beta globin sequence into the pKK-223-3 expression vector, yielding pDL ll-10a (Figure 1) was done 
analogously except that restriction analysis and cloning was done with Pstl and Hind lll. 

Example 2: Separate Expression of Synthetic FX-Aipha a nd FX-Bet a Qiobin 

To assess the expression of the individual FX-globtn gene products, E.coli JM1Q9 clones transformed with either 
pDL ll-62m or pDL ll-10a were inoculated into 2mL of TB media containing ampiciilin (100|yig/mL). The inoculum was 
grown at 37°C for 3-4 hours, then divided into two 1 mL aliquots. One of the allquots was induced by the addition of IPTG 
(IrnM. final) and grown for an additional 3-4 hours The cells were collected by centrifugation, resuspended in 0.5mL 
SDS-PAGE loading buffer and heated at SS'C for 1 0 minutes The total cell protein mixture was electrophoresed on 15% 
SDS-polyacrylamide gels using authentic hemoglobin as a molecular weight standard. 

Example 3 : Polycistronic Coexpression of FX-alpha and FX-beta Globin Gene Products from the same Qperon (PDLII- 
66a) Arxj Conversion of FX-Hemoglobln to Hemoglobin 

To achieve coexpression of FX-alpha and FX-beta globins from a single polycistronic operon, the FX-beta globin 
sequence frompDL 11-1 Oa was excised with Hind lll and Pstt. gel purified, and ligated into Pstl/b!indlllcutandgel purified 
pDL ll-62m. Ligation and transformation conditions were identical to those described above, f^te that each globin cistron 
was proceeded by an "Introduction ' cistron as previously desaibed. so that the entire Tac promoter driven operon had 
four cistrons. Clones were individually examined for the presence of both FX-alpha and f^-beta globin genes by digestion 
of plasmids with Eco RI and separation of fragments by electrophoresis through 0.8% agarose. Plasmids containing both 
genes (pDL ll-66a. Figure 1) produced a fragment of approximately 1 .Okb following EcqRI digestion. 

Clones containing pDL ll-66a were grown in 2mL of TB media containing ampiciilin (lOO^g/mL) for 4 hours at 37**. 
The culture was divided into two 1 mL aliquots, one of which was induced with 1 niM IPTG. Incubation was continued for 
4 hours. Total cell protein extracts for both the uninduced and induced clones were examined by SDS-PAGE electro- 
phoresis to confirm the coexpression of both FX-alpha and FX-beta glot>in. 

To determine if the coexpression of both gene products resulted in the formation of tetrameric FX-alpha g[obin2 FX- 
beta globin2 protein, the following experiments were performed. Two liters of TB medium containing ampiciilin (100ng/ml) 
was inoculated with 20mL of an overnight culture of an FX-alpha/FX-beta expressing E coll clone and grown to an 
optical density at 600nm (ODeoo) of 2,1 at 37^C. The culture was induced with IPTG (2.5mM final concentration) and 
grown to an ODgoo of 3.5. 

The cells (40gm) were collected by centrifugation at 10,000xg and suspended in 80mL of lysis buffer (SOmM Tris- 
HGI, pH 8.0, 25% sucrose, ImM EDTA). Ten milliliters of lysozyme solution (18mg/ml in lysis buffer) was added and the 
mixture incubated on ice for 30 min MgCb, MnCl2. and DNAse I (Sigma, St. Louis. MO) were added to final concentra- 
tions of l OmM . 1 mM and 1 0^g/mL, respectively The cells were incubated at room temperature for 1 hour and an equal 
volume of a solution containing 1% one percent deoxycholic acid, 1% Nonidet P-40, 20mM Tris-HGI pH 7.5. 2 mM EDTA 
was added to the lysate. 

Particulate material was removed by centrifugation at 1 0,000 x g for 1 0 mIn. The supernatant (-200mL) was bubbled 
with carbon monoxide for 5 min and dialyzed overnight against 4 liters of 10mM NaP04 buffer. pH 6.0. The cell-free 
extract was clarified by centrifugation at lO.OOOxg for 10 min. To the supernatant was added 20g DE-52 (Whatman, 
U.K.). The pH of the suspension was adjusted to 7 0 and the ion exchange resin was removed by centrifugation. The 
pH of the supernatant was readjusted to 6.0 and the supernant was loaded onto a CM-cellulose column (2.5 x 1 5cm} 
equilibrated in lOmM NaP04. pH 6 0 at 4*»C. The column was washed with two bed volumes of lOmM NaP04. pH 6.0 
followed by a linear gradient of lOmM NaP04. pH 6.9 to 20mM NaP04. pH 9.0 (400mL total volume). Fractions 36 to 
42 contained a red solution and were combined; an aliquot of this solution was scanned from 650nm to 400nm revealing 
a spectrum identical to that for cartx)xyhemoglobln (Figure 6). An aliquot of the same peak was analyzed by SDS-PAGE 
electrophoresis with hemoglobin as molecular weight standard and was found to contain two protein bands of approxi- 
mate MW 15.500 and 16,200. As expected, these bands migrated at a slightly slower rate than authentic henfX}globin 
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{alpha MWs15,100; beta MW= 15.850) presumably due to the five amino acid extension of the Factor X recognition 
sequence. On this basis the material was designated FX-hemoglobin. 

FX-Hemoglobin (2.0mg) was digested at room temperature for 2 hours in 3mL of 20mM HEPES, pH 7.4, 0. 1 M 
NaCI, 1QmM CaCl2 containing 2mg of trypsin (Sigma, St. Louis. MO.. 10.000 units/mg). SDS-/PAGE electrophoresis 
5 confirmed the conversion of FX-hemoglobin to material that comigrates with native hemoglobin. 

Example 3A : Distribution of FX-Globin Products from E.coli Expression 

One hundred milliliter cultures of E coH clones expressing Fx-alpha globin (plasmid pDL ll-62m), Fx-beta globin 
10 (plasmid pDL 11-1 Oa) and Fx-hemoglobin (plasmid pDL II -66a) were started with 1 ml inocula of overnight cultures. After 
growth for 4 hours, protein expression was induced by addition of IPTG to 1 mM final concentration. Incubation was 
continued for an additional 3 hours. The cells were collected by centrifugation, weighed, and lysed as described alx)ve 
except that DNAase treatment was done for 30 minutes on ice. The samples were centrifuged at SOOOxg. 10 min and 
the supernatants (representing the soluble protein fraction) brought to 2 ml final volume with H2O and frozen at -80°C. 
15 The pellets (representing the insoluble protein or inclusion body fraction) were washed twice in 5 mL 0.5% Thton X-10O, 
1 mM EDTA, resuspended in 2 ml H2O and frozen at -80*C 

Analysis of protein distribution was accomplished by SDS-PAGE and Western blotting. The primary antibody was 
rabbit anti-human hemoglobin IgG. The western blotting protocol was according to the manufacturer's recommendations 
(Proto Blot Western Blot AP system, Promega Corp., Madison, Wl). Samples of soluble and insoluble protein represent- 
ee ing 60 fig of wet cell weight were analyzed from the FX-alpha and FX-beta expressing clones Due to the greater level 
of expression in the FX-hemoglobin clone, material representing only 15 ng of wet cell weight was analyzed. 

As seen in Table 1 1 the distribution of the proteins varied. Fx-alpha globin was detectable only in the soluble fraction 
while FX-beta globin partitioned between the insoluble and soluble fractions of the cel.. FX-Hgb, which stabilizes each 
separate subunit, was found only in the soluble fraction, and at a concentration at least 2.6 times that of the individually 
25 expressed subunits. These results indicate that FX-beta globin is totally soluble only when allowed to assemble with FX- 
alpha globin. 

Example 4: Ck)nstruction And Expression of Mutant FX-Alpha/FX-Beta Tetrameric Hemoglobin Expression Vector 

30 Hemoglobin Beth Israel : pDL 1 1- 10a was digested with restriction enzymes Sad and Soe l. gel purified, and isolated 
by electro-elution. Oligonucleotides incorporating the appropriate codon change for Hemoglobin Beth Israel (beta^^ 
asn->5er) (Figure 7) were synthesized as previously described above to bridge the ^1 10 Qqq} restriction sites. Puri- 
fication and quantitation of the individual oligonucleotides was as previously described. The complementary oligonucle- 
otides were annealed by heating to 95^0 for 1 0 min. followed by slow cooling to room temperature over a 2 hour period. 

35 An aliquot of the annealed mixture was then combined with the Sac l /Spe l digested, gel purified pDL 11-1 Oa plasmid at 
molar ratios similar to those used for the initial FX-alpha and FX-beta cloning. T4 DNA Itgase (2U) was added and the 
mixture incubated for 1 hour at room temperature. E.coli JMI 09 was transformed with this ligation mixture as previously 
described. Individual clones were isolated, and plasmids purified were sequenced using primer beta-1 (vida supra) . 
Plasmid sequencing was done with a Sequenase kit (United States Biochemical Corp., Cleveland, OH) following the 

40 protocol supplied by the manufacturer. The appropriate mutated beta globin sequence was then excised with Hind lll 
and PstI, gel purified, and cloned into pDL ll-62m as described above. 

Other hemoolobin mutants : The synthetic genes encoding Hemoglobin Cheverly (beta^^ phe~>ser) Hemoglobin 
Providence/MSR (beta^^ lys-->asp) and Hemoglobin beta^^ val-->ile and Hemoglobin Kansas (beta^o^ asn">thr) were 
prepared similarly except with synthetic oligonucleotides spanning the Sac ll">BQlll. Sai l Spe l. Nco l"> Kpn l and Sad 

45 ">Spel restriction sites respectively (Figure 7). Synthesis of the mutant oligonucleotides, restriction enzyme digestion, 
gel purification, and ligation conditions were identical to those used for Hemoglobin Beth Israel. All mutations were first 
cloned into plasmid pDL ll-lOa, appropriate clones were sequenced, and the mutated beta globin gene was subdoned 
into PstI and Hind lll digested pDL ll-66a. Plasmid sequencing was accomplished as described previously. E. cold celts 
were transformed, cultured, and induced as previously described. FX-hemoglobin mutants were purified by the method 

50 of Exanple 3. Oxygen binding of purified hemoglobin mutants is shown in Table 9. 

Example 5 Production of Synthetic Met- Alpha/Met- Beta Hemoglobin 
Construction of Des-FX Aloha and Des-FX Beta Globin Genes 

55 

We established that E. coll could produce tetrameric fusion-hemoglobin. Elimination of the DNA sequences coding 
for the Factor X^ substrate recognition site on the N-terminal end of each peptide should result in production of synthetic 
(FX free. "des-FX") hemoglobin containing the N-terminal methionine as its only extra residue. pDL lt-62m was digested 
with EfiiRI and EstI to excise the sequence containing the FX-alpha globin gene. The FX-alpha globin gene was then 
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gel purified. pGEM-1 (Promega Corp.) was linearized with EcoR) and Pstl. gel purified, and the FX-alpha globin gene 
ligated into the plasmid as described previously. Clones (FX-alpha pGEM) containing the FX-alpha globin gene inpGEM- 
1 were Identified by digestion of purified plasmids with EqqRI and Psll followed by agarose gel electrophoretic analysis. 
FX-Alpha pGEM was digested with Ndel and Eao l to remove the Factor Xe coding sequence (Figure 5). Oligonucleotides 

5 containing the DNA sequence encoding native alpha globin were synthesized with ends compatible to IMde l and Eao l 
(Figure 8) restriction sites. After synthesis, the oligoncleotides were purified, annealed, and ligated as described above. 
The sequence of pGEM-desFX-alpha (pDL ll-83a) was confirmed by dideoxy-sequencing of the plasmid using the Tj 
promoter primer (Promega Corp., Madison. Wl). A clone containing the correct sequence was then digested with EcqRI 
and Psll. The des-FX alpha globin gene was gel purified, cloned into EcoRI/PstI digested, gel purified pKK-223-3 to 

10 generate pDL ll-86c. E.coli strain JM109 was transformed with the ligation mixtures and expression of des-FX alpha 
globin by individual clones was determined by analysis of total cell protein extracts of induced culture inocula (see above) 
DesFX alpha globin, in contrast to FX-alpha globin, co-migrates with authentic alpha globin on SDS-PAGE. 

The des-FX beta globin sequence was prepared in an analogous fashion using the FX-beta globin gene excised 
from pDL ll-lOa with Pstl and Hind lll. This gel purified beta globin sequence was then ligated into pGEM-1 that had been 

r5 digested with the same two enzymes. A pGEM-1 plasmid containing the FX-beta globin gene was digested with Nde l 
and Sad I. gel purified, and used for construction of the des-FX beta globin gene. The oligonucleotides conferring the 
desired sequence (Figure 20) were synthesized, purified, annealed, and ligated into Nde l. Sac 1 1 cut pGEM FX-beta to 
form pGEM-des-FX beta (pDL lll-6f) as described above. After confirming that the sequence was correct the des-FX 
beta gene was removed with Pstl and Hind lll and gel purified. Des-FX beta was then ligated into the des-FX alpha- 

20 containing plasmid pDL ll-86c using the Pstl/Hindlll sites. Clones containing both des-FX alpha and beta globin genes 
(pDL lll-13e) (Figure 9) were confirmed by EcoR I digestion of purified plasmids (see above), and screened for expression 
by comparison of IPTQ induced and non-induced cultures. Dec-FX hemoglobin co-migrates with native hemoglobin on 
SDS-PAGE. 

25 Characterization of Met-Hgb (Des-FX Hab^ 

Recombinant methionyl-Hgb has different reactivities than Hgb Ao in the presence of chloride arxl phosphate ions 
(Table 10) and with changes in hydrogen Ion concentration (Figure 1 0) . The reason for this is thought to be the additional 
amino acid, methionine, on the N-termini of both globins. The N-terminal amino group of the alpha chain is important 

30 for the change in P50 by phosphate ion. Displacement of the N-terminal amino group in space or changing its electronic 
state by the addition of the methionine can alter these effects. 

The increase in P50 seen in the presence of inositol hexaphosphate ion is irrelevant for any Hgb found in solution 
because the concentration of monophosphate ion found in plasma is not enough to significantly increase the P^q. There 
is no inositol hexaphosphate ion found in plasma. The increase in P50 needed for Hgb in solution to effectively off-load 

35 oxygen can be best attained by incorporating mutations to make Hgb lower in affinity for oxygen. 

The magnitude of change in P50 with respect to chloride ion is not important physiologically The effect, which Is 
interesting in terms of Its biochemical mechanism, does not add a significant anrwunt of oxygen off-loading capacity to 
Hgb in solution. Some animal Hgb have extremely small chloride effects. Again, the chloride effect is thought to be 
transmitted by the alpha globin N-terminal amino group; with methionyl-Hgb that effect could be changed because of 

40 the stearic change or because the pKa of the methionine amino group is different than that of valine. 

The P50 of Hgb is normally most dramatically changed by hydrogen ion concentration. The so-called "Bohr effect" 
is also thought to, in part, involve the N-terminal amino group of the alpha globin. Numerous mutant human Hgb molecules 
have been shown to have altered Bohr effect changes without any physiological deficiency There is an advantage in 
having a limited Bohr effect, as well as phosphate and chloride effects, in a Hgb molecule to be used in solution for a 

45 number of different medical and biochemical applications. In terms of the Bohr effect, there are several applications for 
methionyl-Hgb where it could be used in the more alkaline pH range, e.g., tissue culture, organ perfusion. Figure 10 
depicts a preliminary experiment which indicates that the P50 is actually greater for methionyl-Hgb than for Hgb Ao at 
greater than pH 7.8. It will be noted from Figure 10 that the slope of the Pso-to-pH plot for Met-Hgb is shallower than 
that for Hgb Ao, i.e. the Bohr effect is smaller. Subsequent experiments suggest that the difference in Bohr effect between 

50 Met-Hgb and Hgb Ao is smaller than that shown in Figure 1 0. 

The main advantage for using a Hgb molecule with fixed changes in P50 relative to pH, chloride and phosphate, is 
that the practitioner will know the oxygen off-loading capacity of the formulation without regard for the specific conditions 
of its use. 

55 
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Example 6 : Synthesis of Synthetic Hemoglobin (Des-Val Hgb) of Native Size 
Construction of Pes- Val- Alpha (pDL 11-91 f) and Des-Val-Beta (pDL 1 1 1-1 a) Globin Genes 

DNA sequences encoding the globin genes in which the N-terminal valine codon in each gene is replaced by an 
ATG (methionine) codon were constructed in a manner analogous to the des-FX clones except that the oligonucleotides 
inserted (Figure 8) were ones encoding the amino acid sequences "met-leu. .." and "met-his..." for alpha and beta globin 
genes, respectively. Following confirmation of correct sequence for both des-val alpha (pDL 11-91 f) and des-val beta 
(pDL lt-95a) genes in pGEM-1 , the des-val alpha globin gene was cloned into the Eco RI/PstI cut pKK-223 (see above) 
to create pDL Ill-la. The des-val beta globin gene from pDL ll-95a was then cloned into pDL Ill-la using the Pstl /Hind lll 
restriction sites. 

More specifically, the des-val alpha transfer vector was prepared from plasmid pDL ll-62m as follows. Plasmid pDL 
ll-62m was digested with EcoRI and PstI to excise the fragment containing the FX alpha globin gene. The FX alpha 
globin genewasthen gel purified. The plasmid pGEM-1 (Promega Corp.) was linearized with EcoRI and PstI, gel purified, 
and the FX alpha globin gene ligated into the plasmid as described previously It was necessary to subclone into pGEfvl- 
1 because redundant restriction sites in pKK 223-3 prohibited the removal of the FX coding sequence directly from the 
individual FX-alpha and FX-beta globin genes. Clones containing the FX alpha globin gene in pGEM-1 were identified 
by digestion of purified plasmids with EcoRI and PstI followed by agarose gel electrophoretic analysis. FX alpha pGEM- 
1 was digested with Ndel and EagI to remove the FX alpha coding sequence and oligonucleotides containing DNA 
sequences coding for native alpha globin in which the N-terminal valine is replaced by a methionine were synthesized 
with ends compatible to Ndel and EagI restriction sites. After synthesis, oligonucleotides were purified, annealed and 
ligated as described above. The sequence of pGEM des-val alpha (plasmid pDL 11-91 f) was confirmed by dideodoxy 
sequencing of the plasmid using the T7 pnmer (Promega Corp., Madison, Wisconsin). A clone containing the correct 
sequence for des-val alpha (pDL 11-91 f) was then digested with EcoRI and PstI. The des-val alpha globin gene was gel 
purified, and ligated into EcoRl/PstI digested, gel purified, pKK-223-3 to generate plasmid pDL Ill-la. 

The des-val beta globin transfer vector was prepared in an analogous fashion using the FX beta globin gene of 
plasmid pDL ll-IOa. FX-beta was excised from pDL ll-iOa with PstI and Hindi. This gel purified beta globin sequence 
was then ligated into pGEM-1 that had been digested with the same two enzymes. A pGEM clone containing the FX 
beta globin gene was digested with Ndel and Sad I, gel purified, and used for construction of the des-val beta globin 
gene. The oligonucleotides encoding the desired sequence for des-val beta were synthesized, purif led, annealed, and 
ligated into Ndel. Sacll cut, pGEM FX-beta to form pGEM des-val beta (plasmid pDL ll-95a). After confirming that the 
sequence was correct for des-val beta globin. the gene was removed with PstI and Hindi II and gel purified. 

Preparation of pDL 111-14 c and iii-3g b (des-v a i- a iph a /des V ai beta P oiy cistronic Qen e Clones ) 

Plasmid PDL Ill-la containing the Des-Val alpha globin gene was digested with PstI and Hindll! and gel purified. 
The Des-Val beta globin gene was removed from pDL ll-95a using the same method. Following ligation and transfor- 
mation, individual clones containing the Des-Val alpha/des-val beta globin coexpressing plasmid pDL 111-14 c were ana- 
lyzed for Des-Val hgb production by IPTG induction and SDS-PAGE. 

Plasmid pDL lll-38b which contains the Des-Val beta globin gene 5' to the Des-Val alpha globin gene was then 
constructed and analyzed. 

Plasmid pDL 111-1 a containing the des-val alpha globin gene was linearized with the restriction enzyme Sma l. The 
plasmid was then treated with bacterial alkaline phosphatase to remove the 5'-phosphate groups, phenol extracted, 
ethanol precipitated, and resuspended in TE buffer as above. A pGEM-1 clone containing the des-val beta gene was 
digested with Hind lll, phenol extracted, ethanol precipitated and resuspended in a ligation mixture containing a 50:1 
molar ratio of a Hind lll- Sma l linker to plasmid. This ligation mixture was then digested with Sma l and the beta globin 
fragment now containing Sma l restriction sites on both the 5'- and 3'- ends was gel purified and added to a ligation 
reaction containing the linearized form of plasmid pDL Ill-la, above, thus obtaining plasmid pDL lll-38b. The orientation 
of the alpha and beta globin genes in pDL II I- 14c and pDL lll-38b was confirmed by restriction analysis. 

Cells of E. coli strain JM109 were transformed with pDL Hi-14c or pDL lll-38b and grown in 2xYT media containing 
ampicillin. Colonies were induced with iPTG as above. Individual clones were analyzed for their ability to produce des- 
val alpha and des-val beta globin polypeptides by SDS-PAGE and Western blotting. There was no appredatile differerx;e 
between expression of immunoreactive des-val alpha or des-val beta globins from the alpha">beta orientation (pDL Ill- 
He) or the beta ->alpha orientation (pDL lll-38b). 
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Example 7: Analysis and Comparison of the Functional Characteristics of Recombinant Pes- FX and Des-Val Hemo- 
globins 

JM109 cells expressing either des-FX hemoglobin (dFX-hgb) or des-Val hemoglobin (dV-hgb) were grown to an 
5 OD600 of 15 in a 10 liter fermenter and then irxJuced by the addition of 300^m IPTG. Induction period was for 6 hrs 
Cells were harvested by centrifugation and frozen at -80°C until processed. 

For purification of hgb approximately 200g of cells were resuspended in 350mL of 50mM sodium phosphate (NaPi) 
buffer, pH 7.0 containing 200 units of aprotinin/mL and 20ing/ml DNAase 1. Cells were then lysed by the addition of 
1 mg/mLof lysozyme and 4 passages through a Dynomill. Cellular debris was removed by centrifugation at 10,000 rpm 
10 in a Beckman JA14 rotor at 4°C for 40 min. The supernatant was added to 200mL of a hemoglobin-binding resin equil- 
ibrated with lOmM NaPi, pH 7.0. The pH was adjusted to 7.0 with 10M NaOH or concentrated phosphoric acid. The 
resin was then loaded into a 5x30 cm chromatography column and allowed to settled. The column was then washed 
with 2.5 column volumes of lOmM NaPi, pH 7.0 containing 100 units/mL aprotinln. Hgb was eluted from the column in 
20mM Tris-HCI, pH 7.5 containing 100 units/mL aprotinin. This partially purified hgb was then 0.2 micron filtered and 
T5 loaded onto a 1.6x10 cm Mono-Q anion exchange column equilibrated with 20mM Tris-HCI, pH 8.0. Hgb was eluted 
using a linear gradient of 0 to 0 4M NaCI in 20 mM Tris-HCI, pH 8.0 The material was then loaded onto a 1 .6x10 cm 
Mono-S cation exchange column. Hemoglobin was eluted with a linear gradient of lOmM NaPi, pH 7.0 to lOmM NaPi, 
pH 8.5. ISOmM NaCI. The major peak of hgb was collected, concentrated to approximately 100mg/mL. and used for 
analysis. 

20 Functionality of the recombinant hgb was evaluated using a Hemox analyzer at 25°C in 50mM HEPES, pH 7.4 
containing 0.1M CI . The following oxygen binding data were obtained: 



SAMPLE 


P50 


N 


Ao 


4.03 


2.7 


dFX-hgb 


3.43 


2.6 


dV-hgb 


7.04 
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Of significance in these data are the following: 

1) The addition of an extra amino acid, methionine, on the N-termini of alpha and beta globins (dFX-hgb) appears 
35 to reduce slightly the P50 of the molecule but has little effect upon cooperativity (N). 

2) Replacement of the N-terminal valines of alpha and beta globins with methionine (dV-hgb) increases the P50 of 
the molecule but has little effect upon the cooperativity (N). 

40 Example 8: Polvcistronic Co-expression of Des-Val -Alpha/ Alpha (Di-Alpha) and Des-Val Beta Globin 

The overall synthetic plan for the preparation of a plasmid (pDL III-47a) which co-expresses di-alpha globin and 
beta globin is given below. The starting materials are the commercially available transfer vectors M13mp19-RF. pKK 
223-3 and pGEM-1. and the synthetic oligonucleotides described in the examples. 
45 For convenience, our manipulations began with plasmids, pDL tl-62m and pDL 11-1 Oa. Plasmid pDL tl-62m was 
obtained by cloning the "FX alpha globin" gene into pKK 223-3 downstream of the Tac promoter. Plasmid pDL ll-10a 
was prepared by an analogous insertion of the "FX beta-globin" gene. 

As set forth in greater detail In Figure 14, the "FX alpha globin" operon encodes two cistrons, the first expressing 
an octapeptide "loader", and the second, alpha globin preceded by Met-lle-Glu-Gly-Arg. The latter four amino acids 
so constitute a recognition site for Factor X cleavage activity. The "FX beta globin" operon is similarly constructed. 

Since the Factor X recognition site was not needed here, the genetic material was manipulated to excise the "FX" 
codons. (This couid have been avoided by synthesizing the desired di-alpha globin and des-val-beta globin genesdirectly 
rather than using the FX-alpha and FX-beta genes of pDL ll-62m and pDLII-IOa.) The FX-alpha globin gene cassette 
was excised and cloned into pGEM- 1 to obtain pGEM FX-alpha. Similarly, the FX-beta gene of pDL ll-lOa was transferred 
55 to pGEM- 1 to Obtain pGEM FX-beta. 

The recognition site (FX)-encoding sequence could now be removed from pGEM FX-alpha and pGEM FX-beta to 
obtain pDL ll-91f and pDL ll-95a. respectively. The des-val alpha globin gene of pDL ll-91f was recloned into pKK 223- 
3 to generate pDL Ill-la. the gene being operably linked to the Tac promoter of pKK-223-3. The des-val beta globin gene 
of pDL ll-95a was purified and inserted downstream of the des-val alpha globin gene of pDL Ill-la to form a single 
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transcriptional unit which would encode a polycistronic alpha globin/beta globin mRNA, see pDL lll-14c. Finally, a syn- 
thetic oligonucleotide comprising the desired di-alpha linker encoding sequence and another copy of the alpha globin 
gene was inserted into pDL lll-14c to create pDL lll-47a, wherein a Tac promoter controls transcription of a di-alpha 
globin gene and a des-val beta globin gene. 

Preparation of pDL 1 11-4 7a (di alphaAjeta globin clone) 

The EagI and PstI restriction fragment containing most of the alpha globin gene from the plasmid pDL ll-91f was 
gel purified and ligated to a synthetic DNA linker containing the sequence from the BstBI site of the alpha globin gene 
to its carboxyl terminus, the glycine linker, and the amino terminus of alpha globin to the EagI site (Figure 12). After 
digesting this ligation mixture with Pst I, the resulting fragment was cloned into BstBI/Pstl-cut pDL III-14C to create 
plasmid pDL lll-47a (Figure 13). 

Expression of Di-alpha/Beta Hemoglobin 

Individual E.coli clones were analyzed by Western blotting for production of dimeric alpha globin protein in combi- 
nation with monomeric beta globin. Appropriate plasmid construction was confirmed by digestion with EcoRI and 0.8% 
agarose gel electrophoresis. The EcoRI fragment present in the di-alpha constructs is approximately 1450 bp. 

Expression of genetically fused hemoglobin was accomplished using the IPTG induction protocol and S-sepharose 
purification of recombinant hemoglobin. E.coli cells (400 ml) were grown to an ODgoo of 3.0 and induced with 1 mM IPTG. 
The cells were allowed to continue to grow for another 4 hours and then harvested by centrifugation. The cell pellet was 
resuspended in the ^QmM sodium phosphate. pH 6.0 containing 1 mM benzamidine, 1 mM EDTA and 0. 1% Triton-XlOO. 
The cell suspension was then sonicated, centrifuged at 15,000xg for 15 minutes, and the supernatant loaded on to an 
S'Sepharose column equilibrated with 10mM sodium phosphate pH 6.0. After the sample was loaded on the column, 
the column was washed with 10 bed volumes of lOmM sodium phosphate pH 6.8. The dialpha hemoglobin was eluted 
from the column with 1 0mM sodium phosphate, pH 7.4, 30mM NaCI. Confirmation of hemoglobin production was accom- 
plished by visible light spectroscopy, SDS-PAGE, and Western blot analysis of purified material. 

Example 9: Preparation of Di-Al p h a Hemoglobin Low Affinity Mutants 

In order to reduce the oxygen affinity of recombinant di-alpha hemoglobin, several mutations were introduced into 
the beta globin polypeptides using synthetic oligonucleotides. The restriction sites used to incorporate these mutants 
are shown in Table 3. For insertion of the Nagal (beta Val 67->lle) and Arg-Nagai (also beta Lys 82-> Arg) mutations, 
the des-Val-beta plasmid pDL ll-95a was digested with the restriction enzymes Ncol and Kpnl and gel purified. Oligo- 
nucelotides spanning these two restriction sites and containing the appropriate codon changes were synthesized, puri- 
fied, annealed and ligated into the gel purified plasmid. Following confirmation of correct sequence, the mutant des-Val- 
beta globin gene was excised with PstI, Hindlll, gel purified, and cloned into plasmid pDL lll-47a. The beta globin gene 
containing the Kansas mutation (beta Asn 102=>Thr) was similarly constructed using Sad and Spel restriction sites. 
Mutated codons fOr all of these beta globin mutations are shown in lower case letters in Table 3. 

Example 10: Characterization of Dl-Alpha Hemoglobins 

Oxygen Binding 

Oxygen binding measurements were made at 37''C in a Hemox Analyzer (Southampton, PA). The solutions were 
50mM Bis-Tris, pH 7.4, 0.1 M NaCi and SOuM heme equivalent of di-alpha hemoglobin. The solutions were measured 
between 120 and 1 .5 torr oxygen pressure. P50 values are given In Table 4. 

In Vivo HalfUfe 

Di-Alpha Hgb (wild type) containing a gly-gly linker between alphas and alpha2 was prepared as described previ- 
ously. The protein was formulated in 20mM NaP04, pH 7.4 at a concentration of 95 mg/ml. Di-alpha Hgb was infused 
into male Sprague-Dawley rats 388 - 426 gm) at a dose of 875 mg/kg through a central venous catheter over 20-30 sec. 
Samples of blood were drawn at 2, 30. 80, 90. 120, 1 50, 1 80. 210, and 240 min. into heparinized vials. The blood was 
centrifuged to remove red blood cells and the plasma hemoglobin was assayed by absorbance at 540 nm. The percent 
hemoglobin remaining versus time was determined by comparison to the 2 min time point which was assumed to be a 
honrwgeneously mixed sample. The same experiment was repeated with non-fused des-val Hgb at a concentration of 
100 mgAnl. The data were averaged for each sample and plotted as percent Hgb remaining against time after infusion. 
The measured half-lives were 205 min and 104 min respectively for di-alpha Hgb and des-val Hgb, respectively 
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Example 1 1 : Co-expression of Wild Type Des-Val Alpha Globin and Di-Alpha Globin with Des-Val Beta Globin Containina 
the Presbyterian Mutation 

Plasmid pSGE0,0-E4 is shown in Figure 1 4 and contains the following modifications as compared to other expression 
vectors derived from plasmid pKK223-3. 

1) The plasmid now contains a functional tetracycline resistance gene. 

2) The lad gene which encodes for the lac repressor protein has been incorporated into the plasmid. The lac repres- 
sor protein represses the TAG promoter until induction with IPTG. The repressor gene was inserted into the plasmid 
to permit transformation of E CQli cell lines which do not have endogenous lac repressor genes. 

The desVal beta globin gene containing the Presbyterian mutation was constructed by insertion of a complementary 
pair of synthetic oligonucleotides into the Sad to Spel restriction of sites of pSGE0.0-E4. 

The following oligonucleotides (Pres-A and Pres-B) were used to construct the Presbyterian mutation in dVal-beta 
globin. 

Pres-A 5* CCACTGCGACAAACTGCACGTTGACCCGG (Continued below) 
Pres-B 3 •TCGAGGTGACGCTGTTTGACGTGCAACTGGGCC (Continued below) 

Sad 



Pres-A AAAACTTCCGTCTGCTGGGTaaaGTA 3 • 

Pres-B TTTTGAAGGCAGACGACCCAtttCATGATC 5* 

See I 



Following digestion with the two restriction enzymes the plasmid was gel purified to remove the wild type encoding 
DNA fragment. The annealed oligonucleotides were then ligated into the plasmid. Following transformation of JM109 
cells individual colonies were selected and analyzed for production of alpha and beta globlns by IPTG induction and 
SDS-PAGE Dideoxynucleotide sequencing was used to confirm the presence of the Presbyterian mutation. 

The mutant hemoglobin was produced, purified, and analyzed as described in Example 7 above. The following 
results were obtained: 



SAMPLE 


P50 


N 


Ao 


4.03 


2.7 


dV-hgb 


7.04 


2.8 


dV-hgbPres 


33.0 


2.5 



If should be noted that the Presbyterian mutation, which results in the change of beta asparagine 108 to lysine, 
decreases the affinity of the molecule for oxygen by 10 fold but does not affect the cooperativity of the molecule. 

The Presbyterian mutation has also been co- expressed with di-alpha globin containing a single glycine linker utilizing 
plasmid pSGE 1 .1 -E4 (Figure 1 5). The result shown below indicate that the joining of the carboxy terminus of alpha 1 to 
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the amino terminus of alpha 2 has no effect on oxygen binding and cooperativrty. 



SAMPLE 


P50 


N 


dV-hgbPres 
dialpha/Pres 


33.0 
33.0 


2.5 
2.4 



Example 12: Construction of a Two Promoter System for the Co-expression of Pi -Alpha Globin and Beta Gtobin 

In this example, a di-alpha globin gene is operably linked to one promoter and a beta globin gene to a second 
promoter, but both genes reside on the same vector. Compare Examples 16 and 17, infra . 

Oligonucleotides (see below) encoding the sequence of the complementary strands of the TAG promoter (syn pTAC) 
and appropriate restriction enzyme sites were synthesized, gel purified, and annealed. Notice that the sequence com- 
plementary to the Xbal restriction enzyme site was designed to eliminate this restriction site when ligated into an authentic 
Xbal site. This was done to facilitate future manipulation of syn pTAC. 

Sequence of syn pTAC 
BamHI PstI 

5 ' GATCCTGCAGAGCTGTTGACAATTAATCATCGGCTCCTATAATGTCTGG ( Cont • d ) 

3 ' GACGTCTCGACAACTGTTAATTAGTAGCCGAGCATATTACACACC ( Cont * d ) 

Xbal COMP 
AATTGTGAGCGGATAACAATTTCACAC 3 • 

TTAACACTCGCCTATTGTTAAAGTGTGGATC 5* 

Plasmid pDL lll-47a (Figure 13) was digested with the restriction enzymes BamHI arxi Xbal, and the plasmid, 
now containing as an insert only part of the beta globin gene from the Xbal site to the Hindi 11 site (Figure 13a), was gel 
purified. 

Syn pTAC was then ligated into the plasmid to create plasmid pDL IV-64a and <JW09 cells were transformed. Indi- 
vidual transformants were isolated and analyzed by IPTG induction and SDS-PAGE to confirm the presence of a func- 
tional TAG promoter. 

Plasmid pDL lll-47a was digested with the restriction enzymes PstI and Hindlll and gel purified to remove the beta 
globin coding sequences. Plasmid pDL IV-64a was digested with these same enzymes and the fragment encoding syn 
pTAG/beta gel purified. Ligation of the syn pTAG/beta fragment into the Pstl/Hindlll digested pDL lll-47a created plasmid 
pDL IV-67a (Figure 16). Irxjividual transformants were screened for production of di-alpha and beta by IPTG induction 
and SDS-PAGE. 

The dVal-beta globin gene under the control of syn pTAC was also ligated into another location in pDL lll-47a. Syn 
pTAC/beta was removed from pDL IV-62a by digestion with Hind III and PstI. The restriction site overhangs were filled 
with T4 polymerase and blunt end ligated Into the Pvu II site of PDL lll-47a to create plasmid pJR Vl-54a (Figure 1 7). 
IPTG induction and SDS-PAGE analysis were as previously described. 

Results of induction experiments as evaluated by SDS-PAGE indicated that control of expression of di-alpha and 
beta by separate promoters gave little increase in the expression of either protein. Similarly, insertion of a second beta 
globin gene under regulation of a separate promoter had little effect upon production of the proteins. 
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Example 13: Hypothetical Protocol for Inse rtion of a Second Translationallv coupled Beta Globin Gene Into the Di- 
Al p ha/B ^ a Ex p ression Pl a smi d 

Plasmid pDL lll*47a (Figure 13) is digested with Hindlll and PstI and the Hindlll/PstI fragment gel purified. The 
restriction site overhangs are then filled by T4 polymerase. The same plasmid is digested with Hindlll and filled in with 
T4 polymerase. The beta globin gene is then blunt end ligated into the plasmid (Figure 18). Following transformation, 
individual clones can be analyzed by restriction digest to confirm the presence and orientation of the inserted beta globin 
gene. IPTG induction and SDS-PAGE analysis can be used to evaluate production to di-alpha and beta globins. 

Example 14: Hypothetical Protocol for Expression of Pi-Beta Globin 

A di-beta globin gene, analogous tothe di-alpha globin genedescribed above can be constructed asfbliows. Plasmid 
pDL ll-95a is a plasmid containing the des-val beta globin gene in the pGem-1 (Promega, Madison, Wl) vector. pDL II- 
95a would be digested with the restriction enzymes Psil and BspM H and the large linear DNA sequence isolated by gel 
purification. A Separate pDL ll-95a plasmid would be restriction digested with PstI and Nhe l. and the small Pstll to Nhe ll 
fragment gel purified. Oligonucleotides encoding the di-beta linker containing the C-terminal amino acids of beta, and 
the N-terminal amino acids of beta2. connected by a linker peptide of variable length (See Example 15) would be syn- 
thesized, gel purified and annealed as described previously The generic sequence for such an oligonucleotide connect- 
ing the C-terminal his of beta, to the N-terminal val of beta2 would be: 

NhQ l 

BsdM II 
His Val 
5 ' -CTAGCTCACAAATACCAC(XXX)„GTTCACCTGACT-3 ' 

3 • -GAGTGTTTATGGTG<XXX) „ CAAGTGGACTGAGGCC-5 • 



where the "XXX" indicates the codon(s) for the amino acid(s) linking the two beta globin peptides. Since the two beta 
globin termini are separated by approximately 1 8 angstroms in the deoxy state, it is anticipated that "n" would be in the 
range of 5 to 9. In the above example the oligonucleotide sequence spans the Nhe I restriction site, through the cartx)xyl 
terminal histidine of betai, a variable length coding sequence containing the amino acid linker to create di-beta, the 
amino terminal valine of beta2 and the BspM II restriction site in the beta2 gene. 

To construct the plasmid encoding the dl-beta polypeptide, the unphosphorylated di-beta linker DNA with Nhel and 
BspM II restriction sites at its termini would be ligated to the PstI /NHe l fragment. Following ligation the dimeric forms 
resulting from the ligation of the Psjl sites would be digested with PstI to produce fragments containing only PstI and 
BsdM II termini. These fragments would then be subcloned into the Pst l /BspM II cut plasmid. Following transformation, 
clones containing the di-beta construct would be confirmed by PstI and Hind lll restriction fragment analysis. The linker 
region would be sequenced and the appropriate constructs subcloned into the expression plasmid containing either the 
di-alpha globin gene or a single alpha globin gene. E. coli would be transformed in the previously described fashion and 
assayed for production of hemoglobin as described before. 

Example 15: Hypothetical Protocol for Development of Linkers bv Mutation and Selection 

In this hypothetical example, the linker is obtained by mutagenesis of a linker-encoding DNA sequence and selection 
for functional linkers. Oligonucleotides spanning the BstBI site of alphai to the EaqI stte aipha2 will be synthesized such 
that the six nucleotides comprising the preferred glycine-glycine linker are randomized. By randomizing these nucle- 
otides, codons for all combinations of amino acids wilt be present in the oligonucleotide mixture. Following purification 
and annealing of the oligonucleotides they will be used to construct the di-alpha/beta co-expression genes as described 
above. 

Clones containing the various di-alphaTbeta plasmids will then be screened for production of increased levels of 
recombinant hemoglobin using a protocol developed at the Company. E. coli clones will be an-ayed on nitrocellulose 
filters overlayed on 2xYT-ampicillin plates containing 1mM IPTG. Following overnight incubation of 37*C, the plates will 
be sealed in a plastic bag in which the air has been displaced by carbon monoxide (CO). CO binding to intracellular 
recombinant hemoglobin produces a distinctive red color in the E.coli colonies. Colonies producing the most intense red 
color will be further analyzed. 

In this experiment the assumption is made that certain combinations of amino aads in the di-alpha linker will permit 
more stable folding of the individual, linked alpha globin chains and, therefore, result in greater levels of production of 
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intracellular recombinant hemoglobin. This inaease level of production will result in a more intense red color in the 
appropriate E.coli clones. 

After selection of several clones producing higher levels of recombinant hemoglobin, more detailed analyses will 
be done on individual clones to determine the optimal di-amino acid linker. The analyses will include determination of 
quantities of recombinant hemoglobin produced, oxygen affinity, and protein stability. Finally, clones found to be produc- 
ing the best quality recombinant hemoglobin will be DNA sequenced to determine the amino adds comprising the linker. 

Example 16: Hypothetical protocol for the synthesis of olasmids containing aloha and beta clobin oenes under the 
regulation of two separate p romoters on the same plasmid. 

It is anticipated that recombinant hemoglobin can be expressed from constructs where the different globin genes 
are under the control of separate promoters. This situation would yield two separate mRNA's; one with a dicistronic 
sequence encoding an alpha globin gene and another with a dicistronic sequence encoding a beta globin gene. For 
construction of an expression system in which both the alpha and beta globin genes are under the regulation of separate 
promoters, on the same plasmid, the following protocol would initially be used. Plasmid pDL Ill-la containing the des-val 
alpha globin gene would be digested with the restriction enzyme Bam HI. reacted with bacterial alkaline phosphatase, 
phenol extracted, ethanol precipitated, and resuspended in TE buffer. The plasmid pJR llll 50-a which is the pKK expres- 
sion plasmid containing the des-val beta construct would then be digested with the restriction enzymes Bam HI and Pyyl, 
to excise a fragment from the plasmid containing the Ptac promoter, the des-val beta sequence, the transcriptional ter- 
minator sequence and a portion of the ampicillin resistance gene. Following gel purification of this fragment, a Pyul- 
Bam H I linker would be synthesized and ligated onto the insert. The insert would then be back-cut with Bam HI to generate 
Bam HI compatible sites on both the 5.- and 3'- ends of the insert. This insert would then be cloned into Bam HI linearized 
plasmid pDL Ill-la. resulting in a plasmid in which a translationally coupled des-val beta globin gene under regulation of 
one Ptac promoter is positioned on the 3' side of a translationally coupled des-alpha globin gene under regulation of a 
separate P,ac promoter. Restriction enzyme mapping would be used to confirm the orientation of the beta globin con- 
taining insert. E. coli JM-1 09 would be transformed with the plasmid containing the separate P,ac -globtn constructs and 
grown in media containing ampicillin to isolate clones containing the plasmid. The clones containing the plasmid would 
then be induced with IPTG and expression of des-val alpha globin. des-val beta globin and des-val hemoglobin would 
be assayed by SDS-PAGE analysis, Western blotting with anti -hemoglobin antibodies and isolation of des-val hemoglobin 
by standard chromatographic methods. 

Alternatively, coexpresslon of both globin genes could be achieved from DNA sequences on separate vectors under 
the control of separate promoters. 

Example 17: Hypothetical Protocol for the Construction of Vectors Containino aloha and beta globin genes under the 
re gulation of separate promoters and on different vectors. 

E. coli clones containing plasmid pDL Ill-la which is the pKK223-3 plasmid containing a dicistronic loader gene/des- 
val alpha construct under the regulation of the Piac promoter would be transformed with a plasmid containing the dicis- 
tronic loader gene/des-val beta construct under control of the same promoter, but with a gene conferring additional 
antibiotic resistance to tetracycline. This could be constructed in the following manner: Plasmid pJR IV-50a contains the 
des-vat beta globin gene under control of the Piac promoter. This plasmid would be cut with Pyuli to generate a linear 
plasmid with blunt ends. This would be ligated with a NotI phosphorylated linker (New England BioLabs). The ligation 
mixture will be used to transform E eQli Plasmid DNA would be prepared and plasmids containing a NotI site identified 
by digestion with NotI agarose gel electrophoresis. This plasmid will contain the Ptac des-val beta globm sequence. The 
gene for resistance to the antibiotic kanamycin is commercially available (Pharmacia) and contains EcoRI restriction 
sites on both ends. The ends will be converted to blunt ends by treatment with T4 DNA polymerase by the method of 
Maniatis. et aL. The resulting fragment will be ligated with a 50 fold excess of phosphorylated NotI linker (25°C, 60 min). 
The ligation reaction wouW be made 0 01 M in EDTA, heated to 70*' for 20 min and ethanol precipitated The precipitated 
DNA will be taken up in 100 ul of NotI buffer and treated with 100 units of NotI (37*^) for 2 hr. The fragment would be 
purified by agarose gel electrophoresis The NotI adapted kanamycin resistance gene would then be ligated into the 
Not I linearized pJR IV-50a to yield a plasmid with the gene for des-val beta globin under control of Ptac with kanamycin 
resistance. E.coli JM-1 09 clones containing plasmid pDL II 1-1 a, the plasmid containing des-val alpha globin under control 
of Ptac with resistance to ampicillin, will then be transformed with kanamycin resistant plasmid containing the gene for 
des-val beta globin and clones will be selected for resistance to both ampicillin and kanamycin. Other antibiotic resistance 
genes could be used as well. Expression of alpha and beta globin polypeptides under the regulation of separate pro- 
moters will then be analyzed by IPTG induction, SDS page and western blotting. 

Should we encounter problems with plasmid exclusion, we could use the same strategy with the pIN plasmids that 
have been used to express polypeptides from separate plasmids In E^cqU (McNally. fit aL. PNAS 85. 7270. 1988). 
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One potential problem that we may face with creating plasmids in which the alpha and beta globin genes are under 
separate but identical promoters is the possibility of homologous recombination within the identical sequences on the 
plasmids, eg. the promoter region. This could result in deletion of a segment of important DNA sequence. It is therefore 
preferable to use different, non-homologous promotors for each different globin gene, eg. P^^ and p,rc, or the lambda 
Pl promoter in appropriate host {containing cl857). 

Example 18: Synthesis and Assembly of the Di-Alpha Beta Globin Construct m aPL Regulated Vector System 

In prior examples, the globin genes were under Tac promoter control, and the alpha (of di-alpha) and beta globin 
genes were each translationally coupled to the ribosomal loader cistron taught by Schoner, et al. In this example, the 
lambda Pi_ promoter and a different translational coupler (see below) are used. 

Translational Coupler 



SD2 JlEI 

5'AAT AAG GAG GAA TAA CAT ATG CTG TCT CCG GCC GAT (Cont'd) 
3*TTA TTC CTC CTT ATT GTA TAC GAC AGA SfiS QSSl £TA (Cont'd) 

EAGZ 



AAG GCC CCA AGC TTG GGG3 * 
TTC CGG GG T TCG AA g CCC5 • 
Hind III 

The pL expression system has a different translational coupler as compared to the pTAC system. Sequences 
coding for the two SD's and the translational stop were added onto the 3' end of the N protein coding sequences to act 
as a translational coupler. Subsequent to that the globin coding sequences are identical to that used in the pTAC system. 

Using the pPL-lambda vector available from Pharmacia, a plasmid construct was assembled to generate a genet- 
ically crossed-linked tetrameric human hemoglobin (wild type) in E. coli strains N99Ci+ and N4830-1 (cl857). These 
bacterial strains were obtained from Pharmacia and are inducabie by addition of naladixic acid (40Mg/ml) or mitomycin 
C (10|ig/ml) in the presence of the wild type Ci-i- repressor or heat treatment of the strain containing the cl857 repressor 
gene (Mott, et al.. pNAS 82. 88, 1985 and Gottesmann, M.E., et al.. J.Mol. Biol. 14Q. 57. 1980). A diagramatic repre- 
sentation of the cloning strategy is depicted in figure 19. 

Removal of the Eaal Site from the pRL-Lambda Vector 

Removal of the EagI site from pPL-lambda was necessary to enable cloning of the dl-alpha gene sequence, because 
both alpha structural genes contains a EagI site located 6 bp into the coding sequence. The pPL-lambda vector was 
digested with EagI. and the ends were filled using T4 DNA polymerase. The Bam HI linker (5'-CCCGGATCCGGG- 
3')(Pharmacia). was blunt-end ligated to the EagI digested pPL-lambda plasmid by standard methods. This eliminated 
the EagI site in the desired construct. The resulting mixture was digested with EagI to eliminate any plasmids still con- 
taining the EagI site. E. coir N99Ci+ cells were transformed with resulting plasmid, pPL-lambda-E. Clones containing 
the desired plasmid were identified by restriction digest analysis. 

Incorporation of the Synthetic Co-Translational Coupler into pPL-Lambda-E 

Prior to inserting the globin genes into the vector It was necessary to incorporate the synthetic translational coupler 
sequence into the Hpal site of pPL-lambda-E. This was done by digestion of pPL-lambda-E with Hpal followed by blunt- 
end ligation of the co-translational coupler into the Hpal site of the vector Ligation of the coupler to the blunt end resulted 
in destruction of the Hpal site. The ligation mixture was treated with Hpal to digest any plasmid remaining containing 
the Hpal site. E. coli N99Ci+ cells were transformed with the resulting reaction mixture. Clones were screened with 
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EcoRI and Hind III restriction digests to identify clones containing the co-translational coupler In the proper orientation. 
DNA fragments of 522 bp and 4762 bp were observed for plasmid containing the desired orientation. To confirm the 
orientation of the coupler, the resulting plasmid was sequenced using a primer (5'CAATGGAAAGCAGCAAATCC-3') 
corrplementary to the sequence 30 base pairs upstream from the translational coupler sequence. The desired plasmid 
was denoted as pPL-lambda-E+TC. 

Construction of an Expression Plasmid Containing Des-Val Alpha and Beta Genes Under Control of pPL-Lambda 

The Des-Val alpha and beta globin genes were obtained from pDL 1 11-1 4c by digestion with EagI and Hind III, followed 
by agarose gel purification of the desired 942 bp segment. The purified alpha and beta globin gene fragment was cloned 
into EatI and Hind III digested pPL-lambda-E+TC. The ligation mixture was used to transform E.coli N99Ci+. and clones 
wre screened for the presence of the desired plasmid, pPL-alphaAieta with EcoRI (4758 and 1468 bp), and with PstI 
and Hind III (4005, 1 775. 520 bp) confirmed presence of the desired restriction sites. Further confirmation was obtained 
through sequencing with the 20 bp primer, above, to confirm the sequence between the co-translational coupler and the 
Des-Val alpha globin gene, and a second primer (5'ACCCGGAAAACTTCCGTC-3") to confirm the sequence between 
Des-Val beta and the pPL vector. 

Construction of an Expression Plasmid Containing Di-Aipha and Beta Globin Genes Under control of pPL-Lambda 

An RGV linker which encodes for the carboxy terminal portion of alpha globin. linked via a single glycine residue to 
the native sequence of the amino portion of a second alpha globin chain, was prepared by separately phosphor ylating 
the 5' ends of 5'CGAAATAACGTGGTGTTCTGTCTGC-3' and 3'TTTATGGCACCACAAGACAGACGCCGG-5" with T4 
kinase, followed by annealing. This dout^le stranded oligonucleotide was cloned onto the EagI end of a purified fragment 
of Des-Val alpha and beta globin prepared from pDL lll-14c digested with EagI and Hindlll, as described above. The 
linear DNA sequence generated from this ligation, now containing sticky ends coding for BstBI and Hindlll restrcition 
site sequences, was purified by agarose gel electrophoresis and cloned into BstBI and Hindlll digested pPL-alpha/beta. 
The new plasmid designated pPL-dialpha/beta contained a sequence with a co-translational coupler upstream to a 
sequence containing di-alpha globin linked via a glycine residue, followed by a ootranslational coupler adjacent to a beta 
globin gene sequence, all under conrol of a sinle Pl promoter. These clones were identified through screening minipreps 
with EagI restriction digestions. Clones without the second alpha globin gene merely linearised upon digestion, whereas 
clones containing the second gene released 431 bp and 6222 bp DNA fragments. 

Construction of the Expression Plasmid pSGEO.I-LO Containing a ROP- Origin of Replication Mutation 

pPL-dialpha/beta was digested with Pvull then treated with T4 DNA polymerase to fill in the sticky ends. The line- 
arized plasmid was then blunt end ligated witha NotI linker (Promega Corp.. Madison. WI)(5TrGCGGCCGCAA-3'). The 
ligation mixture was then treated with Pvull to remove any remaining plasmid containing the Pvull site. E. coli were 
transformed with pSGE0.1-L0 and positive clones were identified by the presence of the unique NotI restriction site. 

Expression of Hemoglobin SGE0.1 in E Coli Under the Control of the Lamtxja Pl Promoter 

E. coli N99Ci+ and E. coli N4830-1 were transformed with pSGEO.1 -LO and grown on agar plates containing amp- 
icillin, as described previously. These E. coli strains contain the cl+ repressor gene and the cl857 heat sensitive repressor 

gene, respectively. 

Inocula of N99Ci-i- were grown at 37°C in TB media to an ODeoo of ~ 1 .0 and induced with naladixic acid (40pLg/ml). 
Cultures were incubated for 4-6 hrs. at 37"C before the cells were harvested Hennoglobin production was estimated by 
SDS-PAGE analysis of total cell protein and by western blot analysis. By these techniques. SGE0.1 was estimated to 
be produced at ~0.02% of the total cell protein in this cell line. Hemoglobin {57\lq) was isolated by Mono Q chromatog- 
raphy and shown to have an optical spectrum representative of that for normal hemoglot^in. 

Inocula of N4830-1 were incubated at 30°C in TB media to an ODeoo of 1.0 and induced by addition of sufficient 
preheated TB media (65"C) to raise the temperature of the inocula TO 42''C The culture was then incubated at 42° for 
4-6 hrs. Total cell protein analysis with SDS-PAGE revealted that SGE0.1 was being synthesized at 0.4% of the total 
cell protein, corresponding to 0.18 mg protein per gram of wet cell paste. SGE0.1 prepared from a 2L preparation was 
purified as described elsewhere, and resulted in isolation of 7.6 mg of purified material. SGE0.1 (7.6mg) was isolated 
and had a P50 of 7.08 (Hemox Analyzer, pH 7.4. 0.1 M NaCI. 37*'C). and an optical spectrum representative of native 
hemoglobin. 
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Example 19 Production of Hemoglobin in Yeast 

All restriction enzymes and DNA-modifying enzymes were purchased from BRL, New England Biolabs, 161, Phar- 
macia or Boerhringer-Mannheim. The concentrations of enzymes used were those suggested by the supplier to produce 
a complete reaction in 30 minutes. The buffers and conditions for the use of these enzymes were those provided with 
the enzymes, unless otherwise stated. Plasmid DNA was purified from E.coli DHSa as described by Birnboim and Doly 
(Nucleic Acids Research 1979. 7:1513-1520). Electrophoretic analysis of DNA was carried out in agarose gels using 
tris-acetate electrophoresis buffer (Maniatis etal. Molecular CloninQ . Cold Spring Harbor, NY. 1982). DNA was visualized 
by staining the gels with 0.5^g/ml ethidium bromide and exposing the gel to ultraviolet light. DNA fragments were purified 
from agarose gels using a kit purchased from BIO-101 . DNA fragments were purified from aaylamlde gels by crushing 
the excised gel fragment, containing the DNA, of interest in 3.25M ammonium acetate and incubating overnight at 37'C. 
Gel fragments are removed by centrifugation (12,000 x g, 15 min) and the DNA precipitated with 2 volumes of 95% 
Gthanol, 5% isopropanol. The precipitate is dried in vacuo and dissolved in 0.1 XTE (1XTE is lOmM Tris. HCI pH7.8, 
1 mM Naa EDTA), Acrylamide gel electrophoresis of DNA was done as described by Maniatis, et al. ( Molecular Cloning . 
Cold Spring Harbor, NY. 1982.) in tris-acetate electrophoresis buffer, Bacteriological grovrth media and DNA transfor- 
mation methods are described by R W Davis et al. (Advanced Bacterial Genetics, Gold Spring Harbor Laboratory, New 
York. 1980, p140-141). Transformation of S. cerevisiae with linear or circular DNA was carried out as described by H. 
Ito et al. (J. Bacteriology 153:1 63-168 (1 983)). Transformants were selected on SD medium lacking uracil or tryptophane 
(SD-ura, SD-trp) depending on the selectable marker on the plasmid (F. Sherman et al., Methods in Yeast Genetics: A 
Laboratory Manual, Cold Spring Hariaor Laboratory, 1979). All other yeast media used are described by Sherman el al. 
(ibid.) 

S YNTHESIS AND ASSEMBLY OF A GALACTOSE REGULATED PROMOTER 

This synthetic promoter consists of two functional parts, a regulatory sequence and sequence that allows efficient 
initiation of mRNA synthesis. One of the regulatory regions we chose includes the nucleotide sequence that confers 
positive regulation of transcription in the presence of galactose (M. Johnston and R. Davis. 1984. Molecular and Cellular 
Biology 4:1440-1448; L. Guarente et al., 1982. Proc Nat Acad Sd (USA) 79:7410-7414.). The transcriptional initiation 
site is derived from the consensus sequence for the S.cerevisiae glyceraldehyde-3-phosphate dehydrogenase gene 
(QAP491) (L McAlister and M.J. Holland. J. Biol Chem 260:15019-15027, 1983; J.P Holland et al., J. Biol Ghem 
258:5291-5299, 1983). 

The synthetic oligonucleotides shown in figure 1 were synthesized on a Biosearch 8600 DNA synthesizer. Each 
oligonud eotide was cleaved from its support column with 28% NH4OH . The blocking groups were removed by incubating 
the cleaved oligonucleotide in 28% NH4OH at 65'C for ^ 16hr. All oligonucleotides were purified by preparative poly- 
acrylamide gel electrophoresis in slabs of 10% acrylamide (19:1 acrylamide:bis-acrylamide) containing 7M urea. Oli- 
gonucleotides were eluted from acrylamide slices by incubation (16hr) in 50mM ammonium acetate (pH7.4), 2.5mM 
magnesium acetate, 0.25mM EDTA and 0,25% SDS at 37''C. Acrylamide fragments were removed by centrifugation 
(14.000xg, lOmin) and the oligonucleotide precipitated from the aqueous phase by the addition of NaCI to 0.25M and 
3 volumes of 100% ethanol. The precipitated oligonucleotides were collected by centrifugation (14,000xg for 30min). 
washed twice with 80% ethanol. once with 100% ethanol and the pellets dried. Pellets were dissolved in 0.1 XTE. 2 x 
lO'io moles (each) of oligonucleotides 1-5 (Table 5) were phosphorylated in 0.02 ml of 0.066M Tris HCI (pH 7.6), 0.01 M 
MgCl2, 0.002M dithiothreitol (DTT), 0.001 M spermidine and 10 units of T4 polynucleotide kinase. Phosphorylation reac- 
tions were carried out at 37°C for 30min and terminated by heating to 96°C for 5 min. 2 x 10"^ 0 moles of oligonucleotides 
1 and 6 (Table 5) were added to the phosphorylated oligonucleotides in af inal volume of 0.04ml of 0.07M Tris HCI (pH7.6) 
0.01 M MgCl2 The mixture of oligonucleotides 1 -6 (Table 5) was heated to 96''C for 5 min, 75°C for 20 min, 55°C for 30 
mm, 37*0 for 60 min and 25°Cfor 15 min. T4 DNAIigase(10 units). DTT (0.002M final concentration), and ATP (0.001 M) 
were added and the mixture incut)ated at 4*'C for 1 6hr The resulting 21 0 bp oligonucleotide contains a 5' end compatible 
with a Sail restriction endonuclease site and a 3" end compatible with an Xbal site. Because the oligonucleotides com- 
prising the two ends of the intact oligonucleotide were not phosphorylated, they cannot ligate to each other. This oligo- 
nucleotide was cloned into the vector pSK+ (Stratagene, Inc.) (Figure 21 (a)) that had been digested with Xbal and Sail. 
The ligation mixture contained 50ng of Xbal, Sail digested pSK(+), and 5pMoles of the ligated oligonucleotide in a volume 
of 0.01ml. E.coli DHSa was transformed with a portion of the ligation reaction and clones that contain inserts were 
identified by screening for white colonies on LB-ampicillin (0.15mg/ml) agar plates supplemented with XGAL (4(ig/ml). 
Positive identification was made by preparing plasmid DNA from these isolates and digesting with Xbai and Sail. The 
restriction digests were analyzed by agarose gel electrophoresis and three clones containing a fragment of the expected 
size (~ 250bp) were Identified. The DNA sequence of all three clones was determined and one was chosen for further 
use and designated pGS2488 (Figure 21(a)). 
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ASSEMBLY OF THE SYNTHETIC GALACTOSE UPSTREAM ACTIVATOR (GALuas) SEQUENCE 

The otlgonucleotides shown in Table 6 were synthesized and purified as described above. Oligonucleotides 2-5 
were phosphorylated, annealed with oligonucleotides 1 and 6 and ligated as described for the assembly of GAR The 
full length oligonucleotide generated by this protocol has non-phosphorylated ends compatible with the restriction endo- 
nuclease sites generated by SphI and Sail. However, when the oligonucleotide is ligated to a Sail site the resulting 
junction formed between the two fragments will no longer contain a cleavable Sail site. 

The GALyAs is contained on an SPHI - Sail fragment. To clone this fragment into pGS2488 required that we change 
the Kpnl site of this plasmid to an SphI site. The plasmid pGS2488 was modified by cleaving with KpnI. The Kpnl digested 
plasmid was incubated with 2 units of T4 polymerase in 0.05ml buffer containing 50^M of each deoxyritx3nucleotide 
triphosphate (A.G.C.T), 0.033M Tris-acetate (pH 7.9), 0.066M potassium acetate, 0.01 M magnesium acetate, 0.5mM 
DTT and lOO^g/ml bovine serum albumin (BSA). NaaEDTA was added to 0.0 15M and the mixture extracted IX with 
phenol-chloroform. DNA was precipitated with ethanol. The dry pellet was dissolved in 0.008ml of T4 DNA ligase buffer, 
50ng phosphorylated SphI linkers {New England Biolabs) and 10 units of T4 DNA ligase. The mixture was incubated 
for 1 hour at 25"C and used to transform E.coli DH5a. Plasmid DNA was prepared from 12 transformants and tested by 
restriction enzyme digestion and agarose gel electrophoresis, for the presence of an SphI site and the absence of a 
Kpnl site. A clone containing a plasmid with these characteristics was identified and the plasmid was designated 
pGS2888 (Figure 21(b)). 

The next step in the assembly of this hytxid promoter was to clone the SphI - Sail fragment containing the GAL^as 
into pGS2888. pGS2888 was digested with SphI and Sal!, phenol-chloroform extracted and ethanol precipitated. Fifty 
nanograms of SphI, Sail digested pGS2888 was incubated with 25ng of the annealed, ligated GALy^s mixture in 0.005ml 
1 X ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated overnight at 4°C and a portion 
used to transform E.coli DH5a. Ampiclliin resistant clones were isolated and plasmid DNA prepared. The plasmid DNA 
(digested with Xbal and Spl) was analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the 
expected size (~ 500bp) was Identified. The sequence of the putative GAL^^s portion of this plasmid was determined 
and the plasmid was designated pGS4788 (Figure 21(b)). The conplete sequence of the synthetic GALGAP promotor 
(pGGAP) Is shown in figure 20. 

CONSTRUCTION OF A oGGAP-S-GLOBIN EXPRESSION CASSETTE 

The plasmid pLCIlFX-p-giobin (K. Nagal. M. Perutz and C. Payart, Proc Nat Acad Sci (USA) 82:7252-7255) was 
used as the source of human p-globin cDNA. The coding region of the p-globin cDNA can be excised as an ApaLI to 
Hind3 fragment that is missing only the first four nucleotides of the p-globin coding (translated) sequence. pLcFX p- 
globln (5ng) was digested with ApaLI and Hind3 and a 550 bp fragment containing the cDNA was purified by acrylamide 
gel electrophoresis. The ApaLI - Hind3 fragment containing the p-g!obin cDNA was cloned into Xbal, HIndS digested 
pUCl9 using the following adaptor (synthesized and purified as described above): 

Xbal Mcol ApaLI 
YHla/b5 • -CTAGAACCATGG 

TTGGTACCACGT-5 • 



The two oligonucleotides were mixed (12.6ng of each), NaCI was added to 0.25M and three volumes of ethanol 
(anhydrous) was added. The precipitated oiigonucleotides were collected by centrifugation (14,000 x g, 1 5 min) and the 
pellet washed twice with 80% ethanol, once with anhydrous ethanol and dried in vacuo. The adaptors were dissolved 
in 1ml of 0.1 XTE. Fifty nanograms of Xbal-HindS fragments containing the p-globin cDNA and 12.5ng of the ethanol 
precipitated (non-phosphorylated) YH1a,b oligonucleotide in 0.010 ml of T4 DNA ligase buffer containing 10 units of T4 
DNA ligase. The ligation mixture was incubated at 4<*C for 16 hr. and a portion used to transform E.coli DH5a. Trans- 
formants were selected on LB-anpicillin plates containing 4^gATtl XGAL. Plasmid DNA was prepared from 1 2 white 
colonies. The DNA was digested with Nco1 or ApaLI and analyzed by agarose gel electrophoresis Four of these colonies 
contained plasmids with the expected restriction fragments and one was designated pUCl9p-globin (Rgure 21 (a)). The 
plasmid pSUC2-6L (Q. Stetler et al. Biotechnology Z:55-60 (1989)) (Figure 21(a)) was digested with HindS and Xbal 
and the large fragment was purified by agarose gel electrophoresis. The Xbal to HindS fragment containing p-globin 
cDNA was also purified (agarose gel electrophoresis) from pUCl9p'globln. Ten nanograms of gel-purified Xbal . HIndS 
digested pSUC2-6L was mixed with 16 nanograms of the Xbal , Hind3 fragment from pUCl9p-globin in 0.01 ml of ligase 
buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated at 25*^0 for 1 hr. and a portion used to 
transform E.coli DH5a. Transformants were selected on LB-ampiclllIn medium and three were used to prepare plasmid 
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DNA. These were analyzed by digestion with EcoRl and analyzed by agarose gel electrophoresis. Two of these con- 
tained plasmids with the expected restriction fragments and one was designated pGS1 188 (Figure 21(b)). This plasmid 
contains p-globin under the transcriptional control of the sigma promoter and contains the 3* transcriptional termination 
signals and polyadenyiation signals of the MFal gene. 

REPLACEMENT OF THE SIGMA PROMOTER WITH pGGAP 

The plasmid pGSl 1 88 was digested with SphI and Ncol and the vector plus p-globin cDNA was separated from the 
Sigma promoter by agarose gel electrophoresis and purified as described previously Plasmid pGS4788 (1 Oug) was also 
digested with SphI and Ncol and the 500bp fragment (containing pGGAP) produced by this digest was purified by 
agarose gel electrophoresis Fifty nanograms of the gel-purified p-globin containing vector was incubated with 50 ng of 
Ncol-SphI fragment containing the pGGAP promotor in 0.01ml of IX ligase buffer with 10 units of T4 DNA ligase. The 
mixture was incubated for 1 hour at 25°C and a portion of the ligation mixture used to transform E.coli DH5a. Ampicillin 
resistant clones were selected. Plasmid DNA isolated from 12 of these clones was analyzed by digestion with Sph) and 
Ncol to Identify plasmids containing the ~ 500 bp GGAP promoter. The presence of the p-globin cDNA was confirmed 
by digestion with Xbal and Sail, followed by agarose gel electrophoresis analysis. A plasmid containing the expected 
fragments was Identified and designated pGS3588 (Figure 21(b)). To aid the subcloning of this fragment into a yeast 
vector, the Smal site of pGS3588 was converted to a Xhol site as follows: 1 ^g of pGS3588 was digested with Smal, 
the digest was extracted once with phenol -chloroform and ethanol precipitated. The precipitated DNA was dissolved In 
T4 DNA ligase buffer, with lOOng of phosphorylated Xhol linker and 10 units of T4 DNA ligase (final volume of 0.01ml). 
The ligation was incubated at room temperature for 2 hr and a portion of the ligation mixture was used to transform E.coli 
DH5a (excess linkers were not removed prior to transformation). Amplclllln resistant clones were isolated and plasmid 
DNA prepared. Plasmids containing the additional Xhol site were identified by digestion with Xhol and agarose gel 
electrophoresis analysis. A plasmid containing the pGGAP -p-globin expression cassette was identified and has been 
designated pGS3888 (Figure 21(b)). 

CONSTRUCTION OF AN a-GLOBIN EXPRESSION CASSETTE 

We obtained a partial length cDNA (pa-MRC) clone from K. Nagai (MRC. Cambridge). To adapt the cONA encoding 
a-globin for expression from the pGGAP promotor, two oligonucleotide primers were synthesized (synthesis and purifi- 
cation of oligonucleotides was as described above). 
a-1 : 5'-GAATTCCATGGTGCTGTGTCGTGGGGACAAGACC-3'. 
a-2:5-CTGCAGTCGACTTAACGGTATTTGGAGGTCAGCACGGTGGT-3*. 

These two oligonucleotides were used as primers for a polymerase chain reaction (PGR) (R.K. Sakai et al., 1985. 
Science 1350- 1354) using a Perkin Elmer-Cetus PGR kit and pa-MRC as template, a-globin cDNA (8.3ug/ml) in 
0.018 ml H2O was denatured by the addition of 0.005ml 10M NaOH. The mixture was incubated at 25'*C for 5 min. 
Denatured DNA was precipitated by the addition of 0.003ml 3M sodium acetate (pH 5.2) and 0.075ml anhydrous ethanol. 
The precipitated DNA was washed twice with 80% ethanol, once with 100% ethanol and dried. The dried pellet was 
dissolved In: 0.005ml 2Q\iM a-1 . 0.005ml 20fiM a-2, 0.010ml 10X Taq polymerase buffer (Perkin Elmer-Cetus), 0.0005ml 
TaqI polymerase (Perkin Elmer-Cetus), 0.663ml H2O, 0.016ml of all four deoxyrlbonucleotldetriphosphates (1.25mM 
each). TaqI polymerase was added after heating the solution to 94°C for 1 min. After addition of the enzyme the aqueous 
solution was overlayed with 0 100ml paraffin oil. The reaction-mixture was cycled, by hand, 25 times at the following 
temperatures: 37''C for 2 min, 68.5°C for 3min and 94°C for 1 min. After the twenty-fifth cycle, the reaction mix was 
Incubated at 37°C for 2 min and 68.5°C for 1 0 min. A portion (0.005ml) of tiie reaction mixture was analyzed by agarose 
gel electrophoresis and revealed a band of the expected size (~ 430bp)- The PCR-amplifled DNA fragment should 
contain a 5* extension, that Includes an ATG codon embedded in an optimal sequence context for initiation of translation 
(M. Kozack 1986, Cell 44:283-292). The 3' end should contain an extension that includes a translational terminator and 
a Sail restriction endonuclease site The PCR-amplrfication reaction was phenol-chloroform extracted and ethanol pre- 
cipitated. The dry pellet was dissolved In 0.05ml 1 mM TrIs-HCI (pH 7.8). A portion of this material (0 025ml. 1 .25ug) was 
digested with Ncol and Sail and purified by acrylamide gel (5%) electrophoresis. A gel slice containing the fragment 
was eluted by crushing the gel slice m 0.3ml 2.5 ammonium acetate (pH 7.4} euKi incubating at 37*'C for 16hr. Acrylamide 
fragments were removed by centrifugation and the DNA precipitated by the addition of 0.75ml of ethanol. The pellet was 
collected and dissolved in 0.020ml 1 mM Tris HCI (pH 7.8) 0. 1 mM EDTA. This fragment was cloned into Ncol, SI digested 
and agarose gel purified pGS3888 Fifty nanograms of Ncol. Sail digested pGS3888 was incubated (2hr, 25°C) witti 
50ng of gel purified, PCR-amplified Ncol-Sall fragment containing the a-globin cDNA in 0.01ml ligase buffer with 10 
units of T4 DNA ligase. A portion of this reaction mixture was used to transform E.coli DHSa- and ampicillin resistant 
clones were selected on LB-ampicillln medium. Plasmid DNA was prepared from 12 independent isolates and digested 
with Ncol and Sail. The restriction digests were analyzed by acrylamide gel electrophoresis (5%), all twelve contained 
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a fragment of the expected size. One of these was designated pGS4088 (Figure 22). The a-glcbin insert in pGS4088 
was completely sequenced to assure that no mutations had been introduced by PCR-amplification. 

CONSTRUCTION OF A YEAST EXPRESSION PLASMID THAT CO-EXPRESSES a- AND B-GLOBIN GENES FROM 
A SINGLE PLA^MID 

The a-gfobin and p-globin expression cassettes from the plasmids pGS3888 and pGS4088 were cloned into a single 
plasmid in a way that allows them to be excised on a single NotI fragment. This Not! fragment was then cloned into the 
high copy yeast plasmid pC1N to generate a plasmid carrying and expressing both a-and p-globin chains under the 
control of separate (though identical) promoters. The details are presented below. 

INTRODUCTION OF NOTI SITE INTO DSKf+) 

The plasmid pSK(+) (Stratagene, Inc.) (Figure 23(a)) was modified by digesting 100ng of purified plasmid DNA with 
KpnI. After digestion was complete, the DNA was ethanol precipitated and the dry pellet dissolved in 0.05ml T4 DNA 
polymerase buffer containing 10 units of T4 DNA polymerase. The reaction mixture was incubated at 37^0 for 20 min, 
NaaEDTA (1 OmM) was added and the sample heated to 70'*G for 1 0 min. The digested DNA was precipitated with ethanol 
and the dry pellet dissolved in 0.01 ml of lOmM Tris HOI (7.8) 1 mM EDTA. A portion of this material (20ug) was dissolved 
In 0.005ml of ligase buffer containing 10 units of T4 DNA ligase and SOng of phosphor ylated NotI linkers. The ligation 
mixture was incubated at 25°C for 2 hr and a portion used to transform E.coli DH5a. Ampicillin resistant colonies were 
selected on LB-ampicillin medium. Plasmid DNA was isolated, digested with NotI and analyzed by acrylamide gel (5%) 
electrophoresis. A plasmid containing the additional Not! site is expected to generate a new, ~ 90bp fragment. Such a 
plasmid was identified and designated pSN(+) (Figure 23(a)). 

CLONING THE pGGAP-a-GLQBIN EXPRESSION CASSETTE INTO pSN(+) 

pSN(+) was digested with Sail, phenol -chloroform extracted and ethanol precipitated. The precipitated DNA was 
dissolved (SOug/ml) in 1 mM Tris-HCI (pH 7.8), 0. 1 mM EDTA. pGS4088 was digested with Xhol and the ^ 1 1 0Obp fragment 
containing the a-globin expression cassette was Isolated from an agarose gel. The purified fragment was dissolved 
(20ug/ml) in 0.1 XTE. Twenty-five nanograms of Sail digested pSN(+) was mixed with SOng of the gel-purrfled a-globin 
fragment in 0.01 ml ligase buffer containing 10 unite of T4 DNA ligase. The ligation mixture was Incubated for 1 .5 hr at 
25^*0 and a portion used to transform E.coli DHSo. Ampicillin resistant clones were isolated. One hundred transfer mante 
were transferred, in a grid pattern, to fresh plates. Replicas of the grid were made on a nitrocellulose filter and prepared 
for colony hybridization (R.W. Davis et al. Adv Bacterial Genetics . Cold Springs Harbor, NY, 1980.). The hybridization 
probe was ollgonudeotide 4 (Tables). This oligonucleotide (20pM) was labelled with 32P-atp ina 0.02ml reaction mixture 
containing 0.050M Tris-HCI (pH 7.6), 0.01 M MgCl2. 0 005M DTT. 0.0001M spermidine. 0.0001M EDTA, 50pM of 32p. 
ATP and 10 units of T4 polynucleotide kinase. The reaction mixture was incubated for 2hr at 37°C, unincorporated ATP 
was removed by spin-column chromatography (BIORAD) used according to the manufactures instructions. The filters 
were incubated (37"'C) in hybridization solution {6XSSC, 50% formamide. 2% SDS, 20pMoles of 32p labelled probe) for 
1 6 hr. The filters were washed 4 times ( 1 5 min each) with 1 X SSC. 0. 1 % SDS at SS^'C; twice (5 min each) with 0. 2XSSC. 
0.1% SDS at 50'C and twice (5 min each) with 0.2XSSC. Autoradiographs were made of the filters. Colonies that pro- 
duced a hybridization signal were used to prepare plasmid DNA. This DNA was digested with EcoRI and the fragments 
produced by this digest were analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the correct 
size was identified and designated pGS4888 (Figure 23(a)). 

CLONING THE pGGAP-p-GLOBIN EXPRESSION CASSETTE INTO pGS4888 

The construction of pGS4888 required cloning an Xhol fragment (pGGAP -a-globin) into a Sail site. The combination 
of a Xhol site with a Sail site destroys the recognition sites for both Xhol and Sail restriction endonuci eases, leaving 
pGS4888 with a single unique Xhol site. An Xhol fragment from pGS3888 was purified by agarose gel electrophoresis 
and cloned Into Xhol digested pGS4888 essentially as described for the construction of pGS4888. Colony filter hybrid- 
izations were performed using the Ncol-Sall fragment (containing p-globin cDNA) from pGS3888 as a hyt)ndization 
probe. This fragment was purified by agarose gel electrophoresis and radioactively labelled with a32P-dCTP using an 
Amersham random oligonucleotide labelling kit. Filters were hybridized, washed and autoradiographs made as described 
for the construction of pGS4888. Plasmid DNA was prepared from colonies producing a hybridization signal and digested 
with either Ncol or Ncol and Sphl. TTiese digeste allow Identification of plasmids that contain both the a- and p-globin 
expression cassettes and reveals the relative orientation of the a- and p-globin transcriptional units. A plasmid containing 
both transcriptional units was identified and given the designation pGSl 89 (Figure 23(b)). 
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CONSTRUCTION OF PLASMID pC1 N 

The plasmid pCIU was modified to introduce a NotI site. pCl U DNA (100ug) was digested with Sail, phenol-chlo- 
roform extracted and ethanol precipitated. The Sail ends were modified with T4 DNA polymerase to produce "blunt" 

5 ends and phosphor ylated NotI linkers were added The procedures used for these modifications are essentially the same 
as those used to produce pSN(+) (desaibed above). The plasmid resulting from these manipulations is called pCIN 
(Figure 23(b)). This pCIN was digested with Not1 . phenol-chloroform extracted and ethanol precipitated. A ~ 2.4kb NotI 
fragment (carrying a- and p-globin expression cassettes) was purified from NotI digested pGSI 89 DNA by agarose gel 
electrophoresis. Fifty nanograms of NotI digested pC1 N was mixed with 50ng of the gel purified, 2.4 kb fragment isolated 

10 from pGSl89 in 0.01 ml of ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 
25'*C for 2hr and a portion used to transform E.coli DH5a. Ampicillin resistant clones were selected on LB-ampicillin 
plates and plasmid DNA prepared. The purified DNA was digested with EcoRI and analyzed by agarose gel electro- 
phoresis to identify plasmids carrying the a- and p-globin genes and to determine the orientation of the insert with respect 
to the vector. Two plasmids were identified, representing the two possible orientations and have been designated pGS289 

J5 (Figure 23(b)) and pGS389 (Figure 23(b)). 

EXPRESSION OF RECOMBINANT HUMAN HEMOGLOBIN IN SACCHAROMYCES CEREVISIAE 

S. cerevisiae strains GSY112 (MATapep4::HlS3prb1 1.6R his3 200 ura3-52 leu2;:hisG cani cir^) and RSY334 
20 (MATa- regl -501 pep4-3 prbl -1 122 ura3-52 leu2-3, Ieu2-1 12) were transformed with plasmids pGS289 or pGS389 by 
the method of Ito et al. (J. Bacteriology 153:163-168 (1983)). Transformants were selected on yeast SD-ura. Single 
colony isolates were picked and streaked to SD medium lacking uracil and leucine. Colonies from this selective medium 
(SD-ura.-leu) were used to inoculate 2ml of SD-uraHeu, the cultures were incubated at 30'C for 24hr and used to 
inoculate 20ml cultures of VP -i- 3% ethanol, these were incubated for an additional 24hr and galactose was added to a 
25 concentration of 2%. Samples were removed at 4, 8, 24. 28, 32 and 48 hours post induction. Cells were collected by 
centrifugation and washed with 1 ml 10mM Tris HCI (pH 7.8), 1 mM EDTA and resuspended in SDS-PAGE sample buffer 
(2 X 108 cells/ml). Samples were boiled for 10 min and the Insoluble material removed by centrifugation, 0.005ml samples 
were analyzed by SDS-PAGE (12.5% gel) followed by transfer to nitrocellulose, a- and p-globin chains were stained 
using commercially available rabbit anti-human hemoglobin and an immunoblotting kit purchased from Promega. The 
30 protocols used were supplied by Promega. The immunobiot indicated that both chains are synthesized and that pGS289 
produces slightly less material than pGS389. An apparent lack of stoichiometry between the a-chain (lower band) and 
p-chain is due to a difference in immunoreactivity of the antibody to the two chains. This was demonstrated by comparing 
Commassie brilliant blue stained gels of purified human and recombinant hemoglobin with immunobtotted samples 

35 Example 20: Characterization Of Human Recombinant Hemoglobin Synthesized in Yeast 

RSY334[pGS389] was grown In 1 00ml SD-leuclne to an ODeoo of 2 4. This was used to inoculate 1 L of YPE. This 
culture was shaken at 30*C, for 24hr at which time 50ml of 40% galactose was added. The incubation was continued 
and the cells harvested 24 hr later (ODqoo 9). The cell pellet was resuspended in 100ml of 0.01 M Tris-HCI (pH 7.1 8), 

40 0.001 M EDTA and carbon monoxide bubbled through the cell suspension for 3 min. The cells were collected by centrif- 
ugation and after the CO treatment were distinctly red. The eel! pellet (19 gm) was resuspended in 19ml of lysis txrffer 
(0.01 M NaP04 pH 6.0, 0.020M DTT. 1% Triton X 100. 0.001 M PMSF, 0.005M benzamidine and O.OSmM leupeptin) and 
bubbled with carbon monoxide for 2 min. The cell suspension was sonicated with Branson 250 sonicator equipped with 
a 0.5 inch disrupter horn. The sonication time was 2 min (0.5 sec pulses) at full power. This was repeated 4 times with 

45 2 min cool-down periods between sonication intervals. The cell debris was removed by centrifugation (27,000 x g for 1 5 
min) and the supernatant saved (0°C). The pellet was resuspended in 5ml of lysis buffer. The resuspended pellet was 
sonicated with the microtip as described above (70% of maximum output) The cell debris was removed by centrifugation 
as described above and this supernatant combined with the first. The combined solutions were clarified by centrifugation 
(38,000 X g, 20 min). producing a clear, red solution. This was loaded (after adjusting the pH to 6.0. with 1 0mM phosphoric 

50 acid) onto a 5ml S-Sepharose fast flow column equilibrated with 0.01 M sodium phosphate (pH 6.0). All of the red material 
bound to the colunn and the column was washed with 20ml of 0.01 M sodium phosphate (pH 6.0). The column was 
eluted with 0.05M sodium phosphate (pH 7.5), 0.1M NaCI and 1 ml fractions were collected. The red color eiuted in two 
fractions. The purity of this material was analyzed by SDS-PAGE and appears to be ^ 50% pure after the first chroma- 
tography step. This material was dialyzed against lOmM sodium phosphate (pH 6.0), 0.00 1M EDTA for 16 hr at O'C and 

56 re-chromatographed on a mono-S FPLC column (ph 6.8 - pH 9.0, 0.02M sodium phosphate gradient). The peak fraction 
from this is s 85% pure. An absorbtion spectrum was obtained by scanning the mono-S purified material from 400mM 
to 650mM with a Shimadzu spectrophotometer (Figure 24, top). The spectrum obtained was identical to that of human 
hemoglobin (Figure 24, bottom), indicating that the protein had foWed and incorporated heme. The amount of hemoglobin 
recovered in the two peak fractions was determined from the extinction coefficient (1 .23 x 10^ A450/M) to be ~ 20mg. 
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Example 21 : Construction of Vectors tor the Expression of a- and 6-Globin from Separate Yeast Plasmids 



In addition to the development of a single yeast vector that carries both a- and p-^lobin expression cassettes we 
also developed a system that uses separate plasmids for each of the two globin cDNA's. The two plasmids each carry 
two yeast genes that are used to maintain the plasmid in yeast. Both have the LEU2 gene in common and one (pGS4688) 
has the URA3 gene, the other (pGS4988) has the TRP1 gene. By using a host that carries mutations in URA3. LEU2 
and TRP1 both plasmids (one with a-globin and the other with the p-globin expression cassette) can be maintained. 
The constructions of these vectors is described below. 

C ONS TR UCTIO N QF p CIT 

The plasmkj pClU (5ug) was digested with BamHI and Sail and the largest fragment was purified by agarose gel 
electrophoresis. The plasmid YRp7 (lOug) (J. Strathern. E, Jones, and J. Broach, The Molecular Biology of the Yeast 
Saccharomyces . Cold Spring Hartx)r, NY. 1 981) was digested with Bglll and Sail and the fragment containing the TRPI 
gene was purified by agarose gel electrophoresis. Twenty-five ng of gel-purified, BamHI and Sail digested pClU was 
mixed with 50ng of the Bglll-Sall fragment in ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture 
was incubated at 25°C for 1 ,5 hr and a portion of the ligation reaction mixture used to transform E.coli DH5a. Tetracycline 
resistant colonies were selected on LB-tetracycline medium. Plasmid DNA was prepared from 1 5 transformants, digested 
with EcoR1 and analyzed by agarose gel electrophoresis. One isolate with the expected EcoR1 restriction fragments 
was chosen and was designated pCIT (Figure 22(a)). 

CLONING THE p^GLOBIN EXPRESSION CASSETTE IN pCIT 

One hundred nanograms of pCIT was digested with Sail, phenol extracted, ethand precipitated and dissolved in 
0.01ml of lOmM Tris-HCI (pH 7.8). ImM EDTA. The plasmid pGS3888 was digested with Xhol and the - 1.2 kb Xhol 
fragment containing the p-globin expression cassette was purified by agarose gel electrophoresis. Ten nanograms of 
Sail digested pCIT was mixed with 60ng of the Xhol fragment containing the p-globin expression cassette in 0,01ml of 
ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 25°C for 30 min. A portion of 
this material was used to transform E.coli DHSo. One hundred ampidllin resistant transformants were picked and patched 
to LB-ampidllin agar. They were incubated for 5 hr at 37*C and cverlayed with a nitrocellulose filter. Plasmids containing 
the Xhol fragment were identified by colony hybridization, using the Xhot fragment as a hybridization probe, as described 
above. Colonies producing an autoradiographic signal were used to prepare plasmid DNA. The purified DNA was 
digested with EcoRI and analyzed by agarose gel electrophoresis for the presence of the expected restriction fragments. 
Two clones containing the desired Xhol insert were identified and the orientation of the insert was determined by agarose 
gel analysis of EcoR1 digests or SphI digest of the plasmid DNA. Both isolates contained the desired insert in the same 
orientation and one was designated pGS4988 (Figure 22(a)). 

CLONING g-GLOBIN EXPRESSION CASSETTE IN pClU 

The Xhol fragment from pGS4088 containing the a-globin expression cassette was purified by agarose gel elec- 
trophoresis and the recovered fragment dissolved (100ng/0.01 ml) in ImM Tris-HCI ph 7.8, 0.1 mM EDTA. Twenty-five 
nanograms of the gel purified fragment was mixed with lOng of Sail digested (phenol-chloroform extracted, ethanol 
precipitated) pClU in 0.01 ml ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incut>ated at 
25*C for 1 . 5 hr and a portion used to transform E.coli DH5a. Amplcillin resistant colonies were selected on LB-ampicillin 
plates Plasmid DNA was prepared from 12 transformants, digested with Hindlll and analyzed by agarose gel electro- 
phoresis. The two possible orientations were isolated and given the designation pGS4488 (Figure 22(b)) and pGS4688 
(Figure 22(b)). 

CONSTRUCTION OF DIPLOID STRAINS EXPRESSING a- AND p-GLOBIN CHAINS FROM SEPARATE PLASMIDS 

S. cerevisiae RSY330 (MAa pep4-3 prb1-1 12 hist7 ura3-52 trpl-289 cani gall) was transformed (Ito et al.) with 
pGS4488 or pGS468d. Trp^ transformants were selected on SD-trp medium and streaked for single colonies. S.cerevi- 
siae BJY1991 (MATa pftjM 1 2 pep4-3 Ieu2-3.1 12 trpi- 101 ura3-52 gal2 can1) was transformed with pGS4988. URA* 
transformants were selected on SD-ura medium and streaked for single colonies. These strains were each tested for 
the production of a- and p-globin as follows: (1 ) Single colonies were picked from SD-selective medium and used to 
inoculate 2ml of SD-selective (liquid) medium. The cultures were incubated for 24 hr at SO'^C and diluted into 25ml of 
fresh SD-selective medium and incubated for an additional 24 hr. The cells were collected by centrifugation and resus- 
pended in 25ml of YP-galactose (2%) and the incubation continued for an additional 24 hr. The cells were harvested 
(8,000 x g, 10 min) and the pellet washed with 50ml 0.010M TrisHCI(pH 7.8). 0.001 M EDTA. The pellets were dissolved 
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by heating to 96°C for 10 min in SDS-PAGE sample buffer and tfie debris removed by centrifugation (15,000 x g, 10 
min). The cleared supernates from 1x10^ cells each were analyzed by SDS-polyacrylamide gel electrophoresis and 
Western immunoblotting as described above, p-globin was readily detectable in extracts from BJY3505[pGS4988]. We 
were also able to detect a-globin cross-reacting material, although the signal strength was considerably weaker. The 

5 production of tetromeric hemoglobin requires the presence of both a and p chains, ideally these would be expressed in 
the same cell. Because strains BJY3505, BJY1991 and RSY330 are haploids they each can be mated with a yeast 
strain of the opposite mating type. Strains RSY330 and BJY1991 are both mating type a. whereas BJY3505 is mating 
type a. BJY3505{pGS4988], RSY330[4688] or BJY1 991 [4688] or RSY330[pGS4688) matings were done and diploids 
selected by streaking onto SD minimal medium with no additional amino acids or other nutrients. Neither of the plasmid- 

w bearing parental strains are capable of growth on this medium, diploids, however, can grow. Because the diploids are 
homozygous for the mutations in TRP1 and URA3 tx)th plasmids must be present for the cells to grow in the absence 
of these nutrients. These diploid strains were analyzed for the synthesis of a and p-globin as described above. A most 
surprising result was obtained. Although a-globin and p-globin are synthesized at low levels in the haploid strains, co- 
expression in a diploid strain results in a substantial increase in the levels of both chains. Furthermore, after induction 

J5 (24 hr) with galactose the cell pellets develop a distinct, pink-red color. These results suggest that: (a) co-expression of 
a-and p-globin stabilizes the two proteins, perhaps as a consequence of their interaction and (b) the protein is apparently 
folding and incorporating heme. 

Example 22: Oxvoen Binding Properties of Yeast-Derived Hemoolobin 

20 

The P50 of the yeast derived hemoglobin is nearly identical to that of Hgb Ao. Depending on how we measure it, 
the P50 ranges from about 5 to 10 torr in a phosphate free solution. At 25°C it is about 4-6 torr in 50mM BisTris, pH 7.4, 
NaCI 0.1 M. In the same solution, at 37<*C, the P50 is from 8.5 to 11 . 

25 Example 23: Expression of Di-Alpha Hemoglobin in S.Cerevisiae 

Methods 

Unless stated othenA/ise, all enzymes (restriction endonucleases. 14 DNA ligase, T4 DNA polymerase, T4 polynu- 

30 cleotide kinase) were purchased from New England Biolabs, Pharmacia, BRL, Stratagene or Boerhinger Mannheim. 
Restriction enzynnes and T4 DNA iigase were used with the buffers supplied by the manufacturers. 

Ethanol precipitation of nucleic acids was carried out by the addition of 0.5 volumes of 7.5M ammonium acetate 
and 2 volumes of 20:1 ethanol-isopropanol. The pellet was collected by centrifugation at 14.000 x g for 15 minutes, 
washed twice with 80% ethanol, once with 95% ethanol and dried in vacuo. Phenol extractions were done by the addition 

35 of 50:49:1 mixture of phenol :chloroform:isoamyl alcohol. Phases were separated by centrifugation at 14,000 x g and 
the aqueous phase collected. Plasmid DNA was purified from E.coli DH5a as described by Birnborn and Doly (Nucleic 
Acids Research 1979, 7:1513-1520). Electrophoretic analysis of DNA was carried out in agarose gels using tris-acetate 
electrophoresis buffer (Maniatis, et al. Molecular Cloning . Cold Springs Hart>or, NY. 1982). DNA was visualized by stain- 
ing the gels with 0.5^g/ml ethidium bromide and exposing the gels to ultraviolet light. DNA fragments were purified from 

40 agarose gels using a kit purchased from BIO- 1 01 . DNA fragments were purified from acrylamide gels by crushing the 
excised gel fragment in 3.25M ammonium acetate and Incubating overnight at 37*'C. Gel fragments are removed by 
centrifugation (14,000 x g, 15min) and the DNA precipitated with ethanol. The precipitate is dissolved in TE (lOmM Tris 
HCI. pH7.8, 1 mM Nas EDTA). Acrylamide gel electrophoresis of DNA was done as described by Maniatis, et al. (Molec- 
ular Cloning. Cold Spring Harbor Laboratory, NY, 1982). Bacteriological growth media and DNA transformation methods 

45 are as desaibed by R.W. Davis, etal. (Advanced Bacterial Genetics. Cold Spring Hart>or Laboratory. NY. 1 980). Media 
for the growth of S.cerevisiae has been described by F. Sherman M aL (Methods in Yeast Genetics: A Laboratory Manual. 
Cold Spring Harbor Laboratory, NY, 1 979). Transformation of S.cerevisiae with linear or circular DNA was carried out as 
described by H. Ito, etaL (J. Bacteriology. 153:163-168(1983)). 

Removal of the PstI and Spel sites from pGS4888 . The design of the synthetic linker for joining two a-globin chains 

so allows the inclusion of PstI and Spel sites flanking a 30bp sequence that includes the junction of the two a-globin coding 
sequences. Because we anticipate testing several different linker sequences, these sites will allow directional cloning 
of relatively short synthetic oligonucleotides encoding different linker sequences. Removal of the PstI and Spel sites 
from the vector sequence is. therefore, necessary so that the sites in the coding region are usable. One jug of the plasmid 
pGS4888 was digested with PstI and ethanol precipitated. The dry pellet was resuspended in 50^ of 33mM Tris-acetate. 

55 pH7.9. 66nnM potassium acetate, 10mM magnesium acetate, O.SmM DTT and 50 fiM of each dNTP (T4 polymerase 
buffer). Two units of T4 DNA polymerase were added and the reaction mixture incubated for ISmin at 37'C. Naa EDTA 
was added to 12.5mM and the reaction mixture heated to 65°C for ISmin, phenol extracted and ethanol precipitated. 
The dry pellet was dissolved in 1 4 ^1 of T4 DNA ligase buffer (BRL) and 1^1 (10 units) of DNA ligase added. The ligation 
mixture was incubated a 4<'C for 16hr. A portion of the ligation reactfon was used to transform E.coli DH5a and trans- 
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formants were selected on LB-ampicillin plates. Plasmid DNA was prepared from 1 2 transformants. The DNA was ana- 
lyzed by agarose gel electrophoresis of PstI digests. Five transformants had lost the PstI site and one of these was 
designated pGS1889. The Spel site of this plasmid was removed as described above after digestion of pGSl889 with 
Spel. A plasmid was identified that had lost both the PstI and the Spel site and was designated pGS1989. 

5 Strategy for Joining Two Copies of the a.-globin cDNA. A fragment containing the 5'-363bp of the a-globin coding region 
can be excised from pGS4888 as an Ncol to ApaL1 fragment. A second fragment containing nucleotides 109 through 
the 3' untranslated region can be removed as a Fokl to Sal! fragment. These two fragments can then be joined together 
to create a single translation unit encoding two tandem copies the of a-chain cDNA by using a synthetic oligonucleotide 
adaptor (Table 8). The purification and assembly of these fragments is described below. 

10 The plasmid pGS4888 was sequentially digested with ApaLI and Ncol and a ~365bp fragment (a-chain 1) purified 
by acrylamide gel electrophoresis. A second fragment of ~351bp (a-chain 2) was prepared by sequential digest with 
Fokl and Sail followed by acrylamide gel electrophoresis. Four oligonucleotides were synthesized (Table 8) and assem- 
bled into a ~173bp linker (RGGV) containing an ApaLI and Fokl ends. Synthesis, purification and assembly of oligonu- 
cleotides was as described previously This adaptor encodes an amino acid bridge linking the carboxy-ter minus of a- 

15 chain 1 with the ami no-terminus of a-chain2 as well as portions of the 3' end of a-chain 1 and the 5' end of a-chain2. The 
carboxy terminal arginine residue of a-chain1 Is separated from the amino terminal valine of a-chain2 in this construct 
by 2 glycine residues. The "diglycine bndge" portion is flanked by Hpal, Spel and PstI sites. These sites allow the sub- 
stitution of a variety of bridges by the use of ~30bp adaptors to connect the two a-chains into an expression cassette 
was carried out in a four part ligation as described below. 

20 Plasmid pGS1989 was sequentially digested with Ncol and Sail and the large fragment containing the plasmid 
vector and pGALGAP was purified by agarose gel electrophoresis (gp1989). Fifty nanograms of gp1989 were mixed 
with 200ng each of the gel purified ApaLI -Ncol a-chain1 fragment and the Fokl-Sall a-chain2 fragment. A twenty-fold 
molar excess of the synthetic ApaLI -Fokl adaptor was added (the 5'-ends of the adaptor segment are not phosphor- 
ylated). The ligation reaction was carried out in a volume of 20^il for 4hr at 23°C. A portion of this reaction mixture was 

25 used to transform E. coli DH5a and ampicillin resistant colonies were selected. Transformants were patched to nitrocel- 
lulose filters and screened by hybridization with the 32p-iabelled otigonucleotide AL2-as (Table 8). Five pMoles of oligo 
AL2-as were incubated m 20^1 of a solution containing 2 units of T4 polynucleotide kinase, 50mM Tris-HCI (pH7.6), 
lOmM MgCl2.6mM DTT, 0.1 mM spermidine, 0.1 mM EDTA. and lOpMoles of y^P-ATP (7000 Ci/mf^ole) for 2hr at Sy^C. 
Filters were processed as described by Maniatis. et al. and the hybridization was done at 37»C in 5XSSC. 50% forma- 

30 mide, 1 0O^g/ml yeast tRNA and 2% SDS for 1 6hr. The filters were washed, sequentially, in 2XSSC and 1 XSSC at SS'C 
for ISmIn (twice in each, all wash solutions contained 1% SDS). Dried filters were exposed to X-ray film and colonies 
giving a hybridization signal were used to prepare plasmid DNA. Plasmid DNA was analyzed by restriction enzyme 
digestion to identify plasmids that contained inserts of a size consistent with two a-chains (Ncol-Sali digest) and that 
contained unique PstI, Spel and Hpal sites. A plasmid identified in this manner was designated pGS2189 (Figure 25). 

35 An Xhol fragment from pGS3888 containing the p-globin expression cassette was purified by agarose gel electro- 
phoresis. Xhol digested pGS2189 (50ng) was combined with 150ng of the gel purified insert from pGS3888 in Ipil of 
ligation buffer containing 10 units of T4 DNA ligase. A portion of this mixture was used to transform E.coli DH5a and 
ampicillin resistant colonies were selected. Plasmid DNA was isolated and analyzed by digestion with Xhol, BamHI or 
Ncol. Several plasmids were identified that produced restriction fragments of the expected sizes, all contained inserts 

40 in the orientation shown in Figure 25. One of these was designated pGS2989. Although this plasmid contains the linked 
a-globin genes and a p-globin gene under the control of separate promoters, it Is not capable of replication in S.cerevisiae. 
The entire expression cassette, containing the two genes (dia and p-globin), can be purified as a NotI fragment. Ten i^g 
of pGS2989 was digested with Pvul and NotI and the NotI fragment gel purified. The digestion with Pvul was done to 
reduce the size of the vector sequences which othenwise would comgrate with the desired NotI fragment. Two hundred 

45 ng of the gel purified NotI fragment was combrned with 50ng of NotI digested pCl N in 1 0^L of ligation buffer containing 
10 units of T4 DNA ligase. The reaction mixture was incubated at 4°C overnight and a portion was used to transform 
E.coli DH5a. Ampicillin resistant transformants were selected and plasmid DNA prepared. DNA was digested with Ncol- 
Sall, PstI or NotI to identify plasmids with the dia, p-globin expression cassette and to determine the orientation of the 
inserted fragment. Several plasmids were identified that contained the correct insert, all of which have the inserted 

50 fragment in the same orientation. One of these wasdesignated pGS3089 (Figure 25). This plasmid was used to transform 
strains GSY1 12 and RSY334. 

55 S.cerevisiae strains GSY1 1 2[pGS3089] and RSY334[pGS3089] were grown to saturation in SD-uracil medium and 
diluted into 2L of YPD medium (all cultures were incubated at 30''C). Twenty-four hours after inoculation (of the YPD 
culture), galactose was added to 1 % and the cultures incubated for another 24hr. Carbon monoxide was bubbled through 
the culture and the cells collected by centrifugation. A 1 :1 mixture (weight:vol) of cells and breakage buffer (1 OmM sodium 
phosphate (pH7.2). 1mM EDTA, ImM EGTA and 5mM DTT) were disrupted in a "Bead-Beater" (Biospec Products, 
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Bartlesville. OK). Debris was removed by centrlfugation (10,000 x g. 30min). The soluble Iraction was adjusted to pH6.0 
with phosphoric acid and chromatographed on a column of S-sepharose (fastftow) equilibrated in 10mM sodium phos- 
phate pH6.0. The loaded column was washed with 1 0mM TrIs-HCI, pH6.7 and hemoglobin eluted by washing with 20mM 
Tris-HCI, pH7.5. A bright red band was collected and the pH adjusted to 8.0. by the addition of NaOH. This material was 
then chromatographed on Q-sepharose (fast flow) equilibrated in 20mM Tris-HCI. pH8.0. Hemoglobin was eluted with 
a NaCI gradient (0-0.4M). Afinal chromatography step was carried out on Sephacryl S-200 equilibrated in 5mM NaP04. 
pH7.4. 0.1 M NaCI. Each step of the purification protocol was analyzed by SDS-polyacrylamide gel electrophoresis (Lae- 
mmli, U.K., 1970, Nature 227 680-685) and staining with Commassie brilliant tHue. The purified protein contains a band 
that comigrates with monomer p-globin and a band in the expected position for a-globin dimer. This material comigrates 
with human tetrameric hemoglobin when analyzed by size exclusion chromatography (Progel TSK G3000 SWXL HPLC 
column). This protem is red and binds O2 with a 50% binding affinity (Pso) of 8-10 Torr, indicating that it has incorporated 
heme and is capable of reversible O2 binding. 

The purified protein was separated tnto a- and p-chains by reverse phase HPLC and the sequence of the 1 0 amino- 
terminal residues of each chain were determined. The sequence matched that of bona fide human hemoglobin, indicating 
that the initiating methionine had been efficiently removed from both the a-globin dimer and the p-globin dimer. 

Example 24: Construction of Low Affinitv. Genetically Cross-Linked. Hemoolobin Mutants and Expression in Yeast 

Construction of vector for site directed mutaoenesis. The Xhol fragment containing the p-globin gene and its GAL- 
GAP promoter waas isolated from pGS2989 by preparative agarose gelelectrophoresis and ligated with Xhol digested 
Phagescrlpt (RF-form, obtained from Stratagene, Inc.). E. coli XL1 -Blue was transformed with the DNA ligation mixture 
and phage containinginserts were identified (white plaques on medium containing XGAL). Single plaques were isolated 
and DNA was prepared and analyzed by digestion with Xhol and agarose gel electrophoresis. This construct was des- 
ignated Mpp-globin. 

Preparation of single stranded template . A saturated culture of E. coli XL1 -Blue (available commercially from Strat- 
agene, 11 099 North Ton-ey Pines Road, LaJolla, OA 92037) (200 jil) was used to inoculate 4 ml of 2X YT broth. This 
culture was incubated for 2 hr. at 37*'C and then infected with a single M13p-globin phage plaque and the incubation 
continued for 6-8 hrs. Cells were removed by centrif ugation and discarded. Phage were precipitated from 1 .6ml of clarified 
medium by the addition of 320 fill of cold 30% PEG 8000 in 3.5M ammonium acetate followed by incubation on ice for 
30 min. Phage were collected by centrlfugation and resuspended in 0.1 ml of lOmM Tris-HCI pH 8.0. ImM EDTA (TE). 
DNA was isolated by extracting twice with phenol/cholorfbrm. DNA (contained in the aqueous phase) was precipitated 
by the addition of NaCI to 0.5M and two volumes of 95% ethanol. The DNA pellet is dissolved in 20hI of water. 

In vitro mutagenesis reactions . Two hundred ng of template DNA are mixed with a twenty-fold molar excess of the 
appropriate, phosphorylated mutagenic oligonucleotide in 10 pil of 20mM Tris-HCI, pH 7.4, 2mM MgCl2, 50mM NaCI and 
heated to 70 °C for 5 minutes. The annealing reaction is allowed to slowly cool (40 min) to 40°C and then to SO^'C (15 
min). After the last annealing step the mixture is transferred to ice for 5 min. To this mixture 1)i.l of synthesis buffer (4mM 
each dNTR 7.5mM ATP. 175mM Tris-HCI pH 7.4, 37.5mM MgCb 215mM DTT), 0.5 \l\ of T4 gene32 protein (2 ng/^l), 
1 ^il of T4 DNA ligase (3 units), and 1 ml of T4 DNA polymerase (1 unit) are added. The mixture is incubated at 37''C for 
90 min (following 5 min at room temperature). The reaction is stopped by the addition of 90ml of 0.1 M Tris-HCI (pH 8.0) 
and 0. 1 M EDTA. Approximately 0. 1 ^1 of the reaction mix was used to transfect E. coli XL1 -Blue cells (Cells were prepared 
for transformation by the method of D. Hanahan, 1983. J. Mol. Biol. 166:557). 

Screening for phage containino the mutant gene . Approximately 50- 1 00 plaques were picked to fresh plates seeded 
with the appropriate host strain (XL1 -Blue) in ordered arrays. After incubation for 6-8 hrs. at 37°C the plates were overlaid 
with nitrocellulose filters and prepared for hybridization essentially as described in Davis, R.W. et al. (Advanced Bacterial 
Genetics: A f^anual of Genetic Engineering. Cold Spring Harbor Lalxjratory, New York 1 980). The choice of hybridization 
temperature with the mutagenic oligonucleotide was determined on the basis of the nucleotide composition of the oli- 
gonucleotide. The oligonucleotide was labelled with -yazp-ATP and polynucleotide kinase (T Maniatis et al. Molecular 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory 1982). Hybridizations were done at 2-5'C below the 
calculated Tm for the correct match In 6XSSC. 2%SDS. lOOng/ml yeast tRNA using 10^ cpm/ml of the labelled oligo- 
nucleotide for >6 hr The Tm was calculated using the formula: 4«C for every GC pair + 2*C tor each AT pair assuming 
an Na-i- concentration of 1M. Filters were washed at the same temperature as the hybridization in 5XSSC, 2% SDS and 
exposed to XRay film. Plaques giving positive hybridization signals were used to prepare single stranded DNA, as 
described above. The single-stranded DNA was used as template for sequencing reactions (Sanger, F. and Coulson, 
A.R. 1975, J. Mol. Biol. 94:441) to confirm that the mutant sequence was indeed present. 

Oligonucleotides used for mutagenesis. 
pN108K.-Presbyterian"5'-AGGCTCCTGGGCAAGGTGCTQGTCTGT-3' 
pE90K-"Agenogi"5'-GCCACACTGAGTAAGCTGCACTGTGAC-3' 
pV67l-(No Alias) 5'-CATGGCAAGAAAATCCTCGGTGCCTTT-3* 
PN1 02T-"Kansas" 5'-GTGG ATCCTGAGACTTTCAGGCTCCTG-3' 
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Cloning into yeast expression vectors . Phage RF DNA was prepared from phage-infected cells that had been 
confirmed to have the mutant sequence and the Xhol fragment containing the altered p-globin gene was purified by 
agarose gel electrophoresis. This fragment was cloned into a derivative of pGS3089 that had been altered to change 
the di-a-linker from the diGly configuration to a single glycine bridge, and from which the p-globin gene had been deleted, 
and designated pGS3889desp (Figure 26). This created a unique Xhol site into which altered p-gtobin expression cas- 
settes could be inserted, allowing coexpression with the a-globin dimer. The expression plasmids generated in this way 
(GS3889 for "Presbyterian", pGS5189 for "Agenogi". pGS5689 for "Kansas" and pGS4989 for "pV67r', all identical to 
pGS3089 except for mutation In specified codon) were used to transform S. cerevislae strains GSY1 12 and RSY334. 

Characteristics of a genetically fused low affinity mutant protein expressed in veast . Cells bearing the plasmid 
pGS3889 (single gly bridge, pN108K alias "Presbyterian") were grown and hemoglobin purified as previously described. 
This material when analyzed for functionality was substantially "right-shifted" compared to the fused protein with a wild 
type p-chain (P5o=23-25 for the mutant with N = 2.5). 

Example 25: Effect of Choice of Strain and Induction Temperature on Expression of Di-Alpha Hemoglobin In E. coli 

Table 100 contains comparisons of di-alphaVbeta fermentations. It also provides comparison of temperatures of 
induction. The column labeled "mg di-A+B" is total mg di-alpha and beta polypeptides per fermentation. The adjacent 
column "mg/OD-L" simply expresses the first column number on a cell density basis. The two columns labeled RHGB 
present total and cell density -corrected output of functional recombinant herTX)globin. The last column shows that in 
terms of final functional hemoglobin recovery strain JM109 is preferable. Without binding ourselves to any theory, we 
believe that this difference has to do with the proteases expressed in different strains. It is interesting to note that of the 
two best JM109 runs, one induction was at 30° and one at 37" with roughly equivalent amounts of final functional Hb 
produced. 

Example 26: Construction of Genetically fused a-Globin Dimers Connected By a Single Glycine or Proline Residue 

Tlie following synthetic adaptors for altering the dia-globin bridge were synthesized and purified as desaibed in 
previous sections. 

TSKYRPVLSPA 
5 '-A ACT AGT AAG TAG AGA OCT GTT TTG TCT CCT GCA-3 RPV 
3»-T TGA TCA TTC ATG TCT GGA CAA AAC AGA GG-5 » 

Hpal PstI 



T S K 
5 '-A ACT AGT AAG 
3'-T TGA TCA TTC 
Hpal 



Y R G V L 
TAC AGA GGT GTT TTG 
ATG TCT CCA CAA AAC 



SPA 
TCT CCT GCA-3* RGV 
AGA GG-5« 

PstI 



Complementary pairs of RPV and RGV oligonucleotides were combined (2.4jjig of each oligonucleotide) in 0.05 
ml of water. The two pairs of adaptors were precipitated (separately) by the addition of 2.^1 of 4M NaCl and 0. 158mL of 
1 00% ethand followed by centrifugation. The pellets were washed with 80% and 1 00% ethanol and dried. The adaptors 
were dissolved in 24^1 of TE. 

Cloning of the RBGV and RPV adaptors . The plasmid of pGS2989 was sequentially digested with the enzymes 
Hpal and PstI and the vector was purified after agarose gel electrophoresis. The digested plasmid was then ligated with 
a 10-fold molar excess of either the RGV or RPV adaptor (these adaptors were not phosphorylated) as previously 
described. A portion of the ligation mixture was used to transform E. coli DH5. Transfbrmants were selected on LB plates 
containing ampiciilin. Clones containing the new adaptor were identified by colony filler hybridization. The hybridization 
probes were either the RPV or RGV upper strands (as shown above). These proves were labelled with "ySaP-ATP and 
T4 polynucleotide kinase. Filters were hybridized with the appropriate probes (10^ cpm/ml) at ST'C in a solution con- 
taining 6XSSC 50% fbrmamide. 2% SDS and 1 50|Lig/ml yeast tRNA. Filters were incubated for >12 hrs. and washed 4 
times with 2XSSC, 2%SDS (250ml, 20min each) and exposed to X-ray film. Colonies that produced autoradiographic 
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signals were used to prepare plasmid DNA which was sequenced using the following primer: 5'-AAGCTTCAGCACCG- 
TATTCA-3' (a^seql). This primer was specifically designed to minimize homology with the corresponding sequence in 
the a1 subunit of the dimer and to maximize homology with sequences near the 5' end of the a2 subunit of the dia dimer. 
This allows sequence to be determined reading from the a region through the sequence that bridges the two a-domains 
without a background sequence reading from a similar sequence near the 5'-end of the a1 -domain. The plasmids con- 
structed in this way were designated PGS2989RPV(sing!e proline linker) or pGS2989RGV (single glycine linker) 

Cloning into veast expression plasmids . The Not! fragments from either pGS2989RPV or pGS2989RGV that contain 
the hemoglobin expression cassette were purified by agarose gel electrophoresis and subcloned into pCiN that had 
been digested with NotI Plasmid DNA from ampicillin resistant transformants was isolated and analyzed by digestion 
with NotI and agarose gel electrophoresis. These plasmids were designated pGS3089RPV or pGS3089RGV. A second 
set of plasmids was generated from these to facilitate the substitution of different p-chain mutants. These plasmids were 
generated by digestion with Xhol and religation under dilute conditions <1|ag/ml). This favors deletion of the p-chain 
expression cassette. Plasmid DNA was isolated from ampicillin resistant transformants and the structures confirmed by 
digestion with NotI and Xhol (these plasmids have been designated pGS3089RGV-desp. see Figure 26. and 
pGS3089RPV-desp). 

Reference Example: Reconstitution of Recombinant Alpha-Globin and Recombinant Beta-Globin with Heme and chem- 
ical Reduction to Yield Artificial Hemoglobin 

Conventional methods of preparing artificial hemoglobin are exemplified by the following procedure. 

The lyophilized recombinant alpha and beta-globins (100 mg each) were individually dissolved in 8M urea/50mM 
Tris-Cl , pH 8.01/1 mM EDTA/ ImM DTT, diluted to a concentration of and incubated at room temperature for 3-4 hours. 
The alpha-globin was then diluted to 0.3mg/ml with chilled 20mM K2HPO4. pH 5.7/1 mM EDTA/ImM DTT. Hemin (25 
mg) was dissolved in 2.4mg 0. 1 M KOH, diluted with an equal volume of 1 M KCN; this solution was then made 0.1mg/ml 
in hemin and 20mM K2HPO4. pH 6.7 with stock phosphate buffer. Hemin from this solution was added to a 2.8 molar 
excess to the chilled alpha-globin; and equal molar amount of beta-globin was added and the solution was dialyzed at 
4''C overnight against 0.1M K2HPO4. pH 7.6/1 mM EDTA/1 mM KCN. The artificial Hb solution was concentrated by ultra- 
filtration using a PM-10 membrane (Amicon) and transferred into a 200ml screw-top test tube with a rubber septum. The 
hemoglobin solution was deoxygenated by evacuation and flushing with N2, and then the solution was saturated with 
CO. lOOmM sodium dithionite solution was prepared anaerobically in a 20ml screw-top test tube with rubber septum. 
4.5 equivalents of dithionite were added to the Hb solution with a syringe, and the mixture incut>at6d on ice for 15 min. 
The Hb solution was gel-filtered against lOmM Na phosphate buffer pH 6.0 on a 4x40 cm Sephadex G-25 (fine) column. 
The colored solution was then applied to a 2x10 cm-52 (Whatman) column equilibrated with the same Ixiffer and the 
chromatography was developed with a linear gradient of 500ml 1 0mM Na phosphate buffer pH 6.0 and 500ml of 70mM 
sodium phosphate buffer pH 6.9. CO was removed from Hb by photolysis under a stream of oxygen. Artificial Hgb 
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prepared this way is isolated in only about 25% yield from the fusion peptides but shows native oxygen binding properties. 
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Tabic 2; Ollaonucleotldc .Sacruencaa Used to Create Dl-Alph;^ 
Globin Genes 

a' (Arg) -Oly-Met (Lau) Linker 

BstBl Eagl 
CGAAATACCGTGGTATGCTGTCTCC 

TTTATGGCACCATACGACAGAGGCCGG 

a' (Arg) -Cly-Gly- (Val) Linker 

BstBl Eagl 

CGAAATACCGTGGTGGTGTTCTGTCTCC 

TTTATGGCACCACCACAAGACAGAGCCCGG 

a' (Arg) -Gly-(Val)a2 Linker 

BstBl Eagl 
CGAAATACCGTGGTGTTCTGTCTCC 

TTTATGGCACCACAAGACAGAGGCCGG 

a< (Arg)-Gly-Gly-(Leu)a2 

BstBl Eagl 

CGAAATACCGTGGTGGTCTGTCTCC 

TTTATGGCACCACCAGACAGAGGCCGG 

(Arg)-(Val)a2 

BstBl Eagl 
CGAAATACCGTGTTCTGTCTCC 

TTTATGGCACAAGACAGAGGCCGG 
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Tablg 4 Pio valugg ggr Herngqlgbin-IiiKc Prqtging 

Hemoglobin 2sj^ 

dos-val Hgb 10.2 
dialpha (arg-gly-inet) Hgb 9.0 
dialpha 

beta^'val — >ile Hgb 16.0 

dialpha 

beta*' val— >ile/ 

beta«2iys — >arg Hgb 16.0 

dialpha (arg-gly-gly-val) Hgb 16.0 



Table S. Synthetic Oligonucleotides Used For The Synthesis of 
pGAP 



1 TCGACTGAAA AAAAAGGTTT AAACCAGTTC CCTGAAATTA TTCCCCTACT 
TGACTAATAA GTATATAAAG 

2 CAATACCTAC CGTTTATATA CTTA TTAGTC AAGTAGGGGA ATAATTTCAG 
GGAACTGGTT TAAACCTTTT TTTTCAC 

3 ACGGTAGGTA TTGATTGTAA TTCTGTAAAT CTATTTCTTA AACTTCTTGA 
ATTCTACTTT TATAGTTAGT CTTTTTTTTA GTTTT 

4 AAGTTCTTGG TGTTTTAAAA CTAAAAAAAA GACTAACTAT AAAAGTAGAA 
AGAAGTTTAA GAAATAGATT TACAGAATTA CAAT 

5 AAAACACCAA GAACTTACTT TCCAATAAAC ACACATAAAT AAACCATGGT 
TAACT 

6 CTAGAGTTAA CCATGGTTTA TTTATGTGTG TTTATTCGAA ACT 



Table 6. Synthetic Oligonucl eotides Used For The Synthesis 
Of The Galactose Upstream Activator 

1 CGTACGGATTAGAAGCCGCCCAGCGGGTGACAGCCCTCCGAAGGAAGACTC 
TCCTCCCTGCGTCCTCCTC TTCACCGGTCGC 

2 AGGACGCACGGAGGAGAGTCTTCCTTCGGAGGGCTGTCACCCGCTCGGGG 
CTTCTAATCCGTACGCATG 

3 GTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAAT 
AAAGATTCTACAATACTACCTTTT ATGGTTATGAAGAGGAAAAT 

4 ATAACCATAAAAGCTAGTATTGTAGAATCTTTATTGTTCGGAGCAGTGCG 
GCGCGAGGCACATCTGCGTT TCAGGAACGCGACCGGTGAAGAC 

5 TGGCAGTAACCTGGCCCCACAAACCTCAAATGAACGAAATCAAATTA 
ACAACCAGATATC 

6 TCGAGATATCTGCTTGTTAATTTGATTCGTTCATTTGAGGTTTGTGG 
GCCCAGGTTACTGCCAATTTTCCTCTTC 



56 



EP0700 997 A1 



Table 7 


Codon Preferences in Yeast 


Ata 


GCU. GCC 


Ser 


UCU. UCC 


Thr 


ACU, ACC 


Val 


GUU GUC 


lie 


AUU, AUG 




GAG 


Phe 


UUC 


Tvr 
lyr 


UAC 


Cys 


UGU 




AAC 


His 


GAG 


Arg 


AGA 


Glu 


GAA 


Leu 


UUG 


Lys 


AAG 


Gly 


GGU 


Gin 


CAA 


Pro 


GCA 


Met 


AUG (No alternative codons) 


Trp 


UGG (ditto) 


Source: Bennetzen arxl Hall, J. Biol. Chem., 257:3026-31 (1982). 



OLIGOhfUCLEOTIDES USED FOR THE CONSTRUCTION OF A GENE 
ENCODING A TANDEM ALPHA-GLOBIN DIMER. 

AL-ISS 

5 • -tgcacgcttctttggacaagttcttggcttctgtttctactgtgttaactagtaagt 
acagaggtggtgttttgtctcctgcagacaagactaac-3 * 

AL-25S 

5*-gttaaggctgcttggggtaaggttggtgctcacgctggtgaatacggtgctgaagcttt 
ggaaaggatgttcttgtct-3 * 

AL-IAS 

5 ' -tgcaggagacaaaacaccacctctgtacttactagttaacacagtagaaacagaagcca 
agaacttgtccaaagaagcg-3 ' 
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AL-2AS 

5 ' -ggaaagacaagaacatcctttccaaagcttcagcaccgtattcacccagcgtgagcacc 
aaccttaccccaagcagccttaacgttagtcttgtc-3 ' 

Note: NCOI - ALPHA- 1-ApaLl; FoKl-ALPHA-2-Sall ; ApaLI-RGGV-Fokl 

— 2j QjQCafiD Affinity of Recombinant MutanH Hemoglobin 

Generated From FX -Hob and Trvptic Digestion 

Oxygen affinity measured at 37 'C in 50 mM Bis-Tris, 
pH 7.4, 0.1 Jl NaCl on a Hemox-Analyzer . Solutions were 60 uU 
in heme and measured between 13 0 and 1.2 torr oxygen tension. 



HgB A, 9.5 

rHgB A, 9.2 

HgB Providence 10.2 

Hgb Kansas 11.3 

HgB (beta val — >ile) 22.4 



The value for Hgb A^ is of course for free hemoglobin 
in solution. The Pj q of whole blood is much higher. 



Table 10 



Effects of NaCI and Inositol Hexaphosphate on Oxygen Binding to Hemoglobin Aq and Recombinant des-Fx Hgb 




P50 


P50 




0.1 M NaCI 


0 M NaCI 


2.2 mM IHP 


OM 1HP 


HgbA« 


6.6 


2.8 


51.1 


6.6 


des-Fx Hgb 


4.9 


3.9 


5.5 


4.9 



Table 1 1 



Distribution of FX- Alpha. FX-Beta and FX-Hgb in E. coli. 




milligrams of protein per OD-iiter of E. coli 




soluble 


Insoluble 


FX-Alpha 


36 


0 


FX-Beta 


21 


21 


FX-Hgb 


188 


0 
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TABLE 100 



5 



10 



25 



35 



PERM DATA 


RFE 


STRAIN 


INDUCT TEMP 


FINAL OD 


MG Dl-A+B 


MG/OD-L 


MG RHGB 


1 


BL21 


30.0 


6.6 






0.0 


5 


BL21HA 


37.0 


9.3 






0.0 


6 


JM109 


37.0 


7.1 


234.0 


16.5 


5.5 


7 


JM109 


37.0 


11.3 


372.0 


16.5 


25.0 


8 


JM109 


37.0 


6.6 


442.0 


33.5 


49 0 


10 


SCSI 


37.0 


15.2 


754.0 


24.8 


38 0 


11 


JM109 


25.0 


11.5 






26 0 


12 


JM109 


25.0 


9.5 






31 0 


13 


JM109 


30.0 


9.0 






64 0 


17 


JM109 


37.0 


14.4 






51 2 


25 


JM110 


30.0 


20.5 






25 0 


26 


LE392 


37.0 


23.0 


1040.0 


22.6 


4 1 


27 


JM110 


30.0 


27.0 


334.0 


6.2 


47 0 


30 


JM110 


37.0 


23.6 






160 


31 


23722 


30.0 


25.4 






59 0 


35 


W3110 


30.0 


18.6 






130 


38 


AG-1 


30.0 


13.0 






57 0 


49 


DH-1 


30,0 


4.0 






77 


50 


NM554 


30.0 


14.0 






56 0 


51 


NM554 


37.0 


14.5 






180 



40 



45 



50 
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TABLE 200: Bacterial and Yeast Vectors 



DEFINITIONS: ROP, Gene which regulates plasid copy number 

R0P+-LOW copy number 

ROP-"Hlgh copy number 
KRf ampicillin resistance used for plasmid selection 
TH, tetracycline resistance used for plasmid 

selection 

TS, tetracycline sensitive, TR gene not functional 

FX- A, FX- alpha gene 

FX-B, FX-beta gene 

DFX-A, Des-FX alpha globin gene 

DFX-B, Des-FX beta globin gene 

DV-A, Des-Val alpha globin gene 

DV-B, Des-Val beta globin gene 

RV-Di-alpha, Di-alpha gene containing no amino acid 
spacer (R-Arginine; V-Valine) 

RGV-Di-alpha, Di-alpha gene containing single glycine 

(G) linker followed by a valine 
RGM-Di-alpha r Di-alpha gene containing single glycine 

linker followed by a methionine (M) 
RGGV-Di-alpha, Di-alpha gene containing two glycine 
linker 

RGGGV-Di-alpha, Di-alpha gene containing three 
glycine linker 

LACI, gene encoding repressor protein which regulates 
TAG promoter 

LAC+-repressor gene on plasmid 
LAC->no reporessor gene on plasmid 



All bacterial plasmids listed below which contain alpha, di-alpha and/or beta genes also have translational couplers. 
The pPL expression system translationally couples the lambda N protein gene to a globin gene. 

pKK223-3 

Parental plasmid obtained commercially from Pharmacia LKB, 800 Centennial Ave.. P.O. Box 1327 Piscataway, N.J. 
08855-1327; all remaining plasmids derived from this parental plasmid. Has TAG promoter fbllowed by a poly-restrictlon 
site region to facilitate gene insertion. 
AR, TS. ROP+. LAC- 

1. pDL ll-62m 
pKK223-3 containing FX-A 
AR, TS, ROP+. UVC- 

2. pDL ll-10a 

pKK223-2 containing FX-B 
AR. TS. ROP+. LAC- 

3. PDLII-66A 

Parental plasmids are 1 and 2; contains both FX-A and FX-B in single operon 
AR. TS. ROP+. LAC- 

4. pGEM FX-A 

Parental plasmids are 1 and pGem1 which is commerically available from Promega Corporation. 2800 Woods Hollow 
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Rd., Madison, Wl 53711. pGem1 containing FX-A 
AR 

5. pGEM FX-B 

5 Parentals are 2 and pGemI 
pGemI containing FX-B 
AR 

6. pDL ll-83a 

10 Parental is 4; contains DFX-A 

7. pDL l!l-6f 

Parental is 5; contains DFX-B 
AR 

15 

8. pDLII-86c 

Parentals are pKK223-3 and 6 
contains DFX-A 
AR, TS. ROP+. LAC- 

20 

9. pDLIII-13e 
Parentals are 7 and 8 

pKK223-3 containing both DFX-A and DFX-B 
AR. TS. ROP+, LAC- 

25 

10. pDL ll-91f 

Parental is 4 contains DV-A 
AR 

30 11.pDLII-95a 

Parental is 7 contains DV-B 
AR 

12. pDL lll-1a 

35 Parentals are pKK223-3 and 1 0 
contains DV-A 
AR, TS. ROP+, LAC- 

13, pDL III-14C 

40 Parentals are 1 1 and 12 
contains DV-A and DV-B 
AR. TS. ROP+. LAC- 

14 pDL lll-47a 
45 Parental is 1 3 

contains RGM-DIALPHA and DV-B 
AR. TS. ROP+, LAC- 

15. pDL III-82A 
50 Parental is 13 

contains RGGV-DIALPHAand DV-B 
AR. TS. ROP+, LAC- 

16. pDL IV-8a 
55 Parental is 13 

contains RGV-DIALPHA and DV-B 
AR. TS. ROP-h. LAC- 
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17. pDLIV-47b 
Parental is 13 

contains RV-DIALPHA and DV-B 
AR. TS, ROP+. LAC- 

18. pDL IV-66a 
Parental is 1 3 

contains RGGGV-DIALPHA and DV-B 
AR, TS, ROP+. LAC- 

19. pDL IV-3a 
Parental is 15 

ROP gene is inactivated by insertion of a Not I linker into the Pvu II site within the POP gene 
AR, TS. ROP-, LAC- 

20. pDL IV-38a 
Parental is 1 5 

contains the Nagai mutation in DV-B 
AR, TS, ROP+, LAC- 

21. pDLIV-58f 
Parental is 20 

ROP gene inactivated as in 19 
AR. TS, ROP-. LAC- 

22. pDL IV-59a 

Parental is 21 and pBR322 which is commerically available from a number of different suppliers, 
contains a functional TR gene constructed in the following manner: 

The Eco R1 site of pBR322 was changed to a Bam Hi site by insertion of a Bam Hi linker. This permitted removal 
of the 5' end of the TR gene from pBR322 as a Bam HI fragment This fragment was then inserted into the Bam 
HI site located at the junction of the TAG promoter and the inactive TR gene of pKK223-3. Insertion of this fragment 
reactivates the TR gene if the fragment inserts in the proper orientation. Selection of colonies on tetracycline plates 
assures presence of the fragment in the proper orientation. 
AR, TR, ROP-. LAC- 

23. pJR V-83a 
Parental is 1 1 

contains DV-B with the Presbyterian mutation (asn108 to lys). 
AR. TS. ROP+, LAC- 

24. pJR Vl-29a 
Parentalsare 15 and 23 

contains RGGV-DIALPHA and DV-B with Presbyterian mutation 

AR, TS, ROP-, LAC- 

25. pJR Vl-53b 
Parental is 24 

made TR by insertion of BAM HI fragment 
AR. TR, ROP+. LAC- 

26. pJRVI-61a 
Parental is 25 

made ROP- by insertion of Not I tinker into Pvu II site 
AR. TR. ROP-, LAC- 

27. pDL V-4a 
Parentals are 16 and 26 

contains RGV-DIALPHA and DV-B with Presbyterian mutation 
AR. TS. ROP-, LAC- 
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28. pDLV-10a 
Parental is 27 

insertion of Bam H1 fragment to convert to TR 
AR. TR. ROP-. LAC- 

5 

29. pDLV-16d 
Parental is 28 

contains LACI gene inserted into Not I site LACI gene obtained using following protocol: 

Polymerase chain reaction (PGR) primers containing NOT I sites at their 5' ends were used to amplify the lac I gene 
10 Following gel purification the gene was inserted into the NOT I site of pDLV-la AR. TR, ROP-. LAC+ 

Several other plasmids constructs fiave been designed to facilitate the incorporation of a second beta globin gene 
under regulation of its own TAG promoter. 

IS 30. pDL IV-64a 
Parental is 1 4 

contains beta globin under regulation of a synthetic TAG promoter 
AR. TS, ROP+, LAC- 

20 31.pDLIV-67a 

Parental plasmids are 1 4 and 30 

contains D I ALP HA under regulation of one pTAG and DV-B under regulation of a second pTAG 
DV-B IS adjacent to D I ALPHA 
AR. TS, ROP-, LAC- 

25 

32. pJR Vl-54a 

Parental plasmids are 1 4 and 30 

contains D I ALPHA and DV-B under regulation of one pTAC and a second DV-B under regulation of another pTAC 
The second DV-B is inserted into the Pvu II site of the plasmid 
30 AR. TS, ROP-, LAC- 

33. pPL Lambda, 

Gommercially available plasmid from Pharmacia LKB (see above) contains pL promoter and coding region for N 
protein of lambda which can be used for expression of fusion or translationally coupled recombinant proteins. 

35 

34. pPL-alpha/beta 

Parental plasmids are 1 3 and 33 
contains DV-A and DV-B 
AR, ROP+ 

40 

35. pPL-dialphaTbeta 
Parental plasmid is 34 
contains RGV-DIALPHA and DV-B 
AR. ROP+ 

45 

36. pSGE0.1-L0 
Parental plasmid is 35 

ROP gene inactivated by insertion of Not I linker into Pvu II site in ROPgene 
AR, ROP- 

50 

37. pSK+ 

Gommerciaily available from Stratagene, LaJolla, Ga. 

38. PGS2488 

55 Derived from (37) by insertion of synthetic GAP 491 transcriptional initiation site. 

39. pGS2888 

Derived from (38) by conversion of Kpnl site to SphI site. 
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40. PGS4788 

Derived from (39) by insertion of synthetic GALuas '"to (39) to form GALGAP hybrid promoter. 

41. pLC IIFX-p-globin 

Plasmid available from Kiyoshi Nagai, Medical Research Council, London. England: bears p globin gene 

42. PUC19 

Plasmid commercially available from Bethesda Research Laboratories, Gaithersburg, Maryland 

43. PSUC2-6L 

Plasmid described by Stetler, et al., Biotechnology, 7:55-60 (1989) 

44. pUCISp-globin 
Derived from (41) and (42) 

45. Plasmid pGSllBS 

p-globin gene (from 44) is under control of the Sigma promoter and the MFX terminator (both from 43). 

46. Plasmid pGS3588 

Derived from (45) and (40); sigma promoter replaced by GALGAP promoter of (40). 

47. Plasmid pGS3888 

Denved from (46) by conversion of Smal site to Xhol site. 
47a. Plasmid pa-MRC 

Plasmid available from K. Nagai. MRC; bears alpha-globin gene. 

48 Plasmid pGS4088 

Derived from (47) by insertion of a-globin gene, replacing the p-globin gene. 

49 Plasmid pSN(+) 

Denved from (37) by changing Kpml site to Not! site. 
50. Plasmid pGS4888 

Derived from (49) by insertion of pGGAP-a-globin expression cassette from (48). 
61 Plasmid pGS 189 

Derived from (50) by insertion of pGGAP-p-globin expression cassette from (47). Plasmid thus bears both GALGAP 
a globin and GALGAP p globin operons. 

52 Plasmid pClU 

Plasmid described in Stetler, et al.. Biotechnology 7:55-60 (1989) 

53 Plasmid pC1N 

Derived from (52) by addition of Not! site. 

54 Plasmid pGS289 

Derived from (53) by insertion of a-globin and p-globin expression cassettes. 

55 Plasmid pGS389 

Ditto, but with Insert orientation reversed. 

56. Plasmid pYRp7 

Plasmid described in Strathern, et al., The Molecular Biology of the Yeast Saccharomvces (Cold Spring Harbon 
1981) 

Source of TRP1 gene. 
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57. pClT 

Derived from (52) and (56), 

Replacing Ura3 gene (52) with Trpi gene (56). 

58. pGS4988 

Derived from (57) by insertion of p-globin expression cassette from (47). 

59. Plasmid pGS4488 

Derived from (52) by insertion of a-globin expression cassette of (48). 

60. Plasmid pGS4688 

Ditto, but with insert orientation reversed. 

61 . Plasmid pGS4888 

Derived from (49) by insertion of GGAP -alpha globin expression cassette from (48). 

62. Plasmid pGS1889 

Derived from (61) by removal of PstI site. 

63. Plasmtd pGS1989 

Derived from (62) by removal of Spel site. 

64. Plasmid pGS21 89 

Derived from (61) and (63). Encodes di-alpha qlobin with RGGV linker. 

65. Plasmid pGS2989 

Derived from (64) and (47), contains di-alpha globin gene and beta globin gene. 

66. Plasmid pGS3089 

Derived from (53) by insertion of expression cassette of (65). 
66a. Phagescript 

Phage commercially available from Stratagene. 

67. Phage Mpp-globin 

Derived from Phagescript by insertion of GALGAP promoter and p-qlobin qene from (65). 
67a. Plasmid pGS3089 RGV desp 

Derived from (66) by deletion of Xhol fragment containing p globin expression cassette. 

68. Plasmid pGS3889 

(66) with "Presbyterian" (pN108K) mutation. 

69. Plasid PGS5189 

(66) with "Agenogi" (pESOK) mutation. 

70 Plasmid pGS5689 

(66) with "Kansas" (pN102T) mutation. 

71. Plasmid pGS4989 
(66) with "PV67I" mutation. 

72. Plasmid pGS2989 RPV 
(64) with RPV di-alpha linker 

73. Plasmid pGS2989 RGV 
(64) wiht RGV di-alpha linker 
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74. Plasmid pGS3089 RGV 

(53) with Hgb expression cassette from (73) 

75. Plasmid pGS3089 RPV 

5 (53) with Hgb expression cassette from (72) 



TABLE 300 



10 



20 



TABLE OF BACTERIAL STRAINS 




Strain 


Availability 


Expression 


1 


BL21 


Brool^haven 




2 


JM109 


Commercial 


TAC 


3 


SCSI 


Commercial 


TAC 


4 


JM110 


ATCC 


TAC 


5 


LE392 


ATCC 


TAC 


6 


23722 


ATCC 


TAC 


7 


W3110 


ATCC 


TAC 


8 


AG-1 


Commercial 


TAC 


9 


DH1 


ATCC 


TAC 


10 


NM554 


Commercial 


TAC 


11 


N99cl+ 


Commercial 


PL 


12 


N4830-1 


Commercial 


PL 



Claims 

35 

1. In a method for the production of a hemoglobin-like protein wherein an alpha globin-like polypeptide and a beta 
globin-like polypeptide are each expressed in transformed, non-erythrocyte cells, the improvement comprising 
expressing the alpha globin and beta globin-like polypeptides in the same cell in such manner that the alpha and 
beta globin-like polypeptides are assembled and combined with heme so as to intracelluiarly produce a biologically 

40 functional hemoglobin-like protein. 

2. The method of claim 1 which comprises (a) providing a yeast cell bearing (i) a gene encoding a human alpha globin- 
like polypeptide, said gene being linked to a promoter functional in said cell, and (ii) a gene encoding a human beta 
globin-like polypeptide, said gene being operably linked to a promoter functional In said cell, and (b) co-expressing 

45 said alpha globin and beta globin-like genes to obtain alpha globin and beta globin-like polypeptides, said cell folding 
together such polypeptides and incorporating heme to produce a human hemoglobin-like protein in soluble, recov- 
erable form. 

3. The method of claim 2 wherein at least one of said genes is expressed under the control of a GALGAP hybrid 
50 promoter. 

4. The method of claim 2 wherein both genes are carried by the same plasmid. 

5. The method of claim 2 wherein said gene is carried by a different plasmid. 

55 

6. The method of claim 2 wherein the yeast cell is of the species Saccharomyces cerevisiae . 

7. The method of claim 2 wherein the yeast cell is of a protease^leficient strain. 
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8. The method of claim 2 wherein the hemoglobin-like protein is a non-naturally occurring, mutant hemoglobin. 

9. The method of claim 1 which comprises (a) providing a transformed non-human cell bearing a polycistronic genetic 
unit comprising a first cistron coding for an alpha gtobln-like subunit protein and a second cistron coding for a beta 
globin-like subunit protein which may precede or follow said first cistron, and a single promoter controlling transcrip- 
tion of said polycistronic genetic unit, said cell being capable of synthesizing heme and Incorporating the heme into 
said subunit proteins so as to form a human hemoglobin-like protein, and (b) cultivating the cell under conditions 
suitable for expression of said human hemoglobin -like protein. 

10. The method of claim 9 in which the cell is a bacterial cell. 

11. The method of claim 10 in which the bacterial cell is Escherichia coll . 

12. The method of claim 1 1 in which the promoter Is the Tac promoter. 

13. The method of claim 9 wherein said first and second cistrons are preceded, respectively by first and second intro- 
ductory cistrons, wherein the second cistron may precede or follow the first cistron, and the first and secondary 
introductory cistrons each contain a Shine-Delgarno sequence operably positioned to facilitate translation of the 

following cistron. 

14. The method of claim 10 wherein a substantial portion of the functional human hemoglobin-like protein is found in 
the soluble fraction of the celt. 

1 5. The method of claim 13 wherein the human hemoglobin-like protein has a lower oxygen affinity than native human 

hemoglobin. 
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pDL ll-62rn 




FIG 2acont 
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FIG. 2b. 
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/Mlu I 

AAC GCG TTG TCT GCT CTG TCT GAT CTG CAC GCT CAC AAA CTG CGT GTT 



T7G 


CGC 


AAC 




CGA 


GAC 


AGA 




GAC 


GTG 


ACG 


GAC 


GAC 


CAA 


TGA 


GAC 


asp 


ala 


leu 


ser 


ala 


leu 


ser 


asp 


leu 


his 


ala 


his 


lys 


leu 


arg 


val 




/Hpal 




























GAT 


CCG 


GTT 


AAC 


TTC 


AAA 


CTG 


CTG 


TCT 


CAC 


TGC 


CTG 


CTG 


GTT 


ACT 


CTG 


CTA 


GGC 


CAA 


TTG 


AAG 


TTT 


GAC 


GAC 


AGA 


GTG 


ACG 


GAC 


GAC 


CAA 


TGA 


GAC 


asp 


pro 


val 


asn 


phe 


lys 


1 All 

leu 


leu 


ser 


nxs 


cys 


leu 


leu 


val 




leu 










/Kael 






















GCT 


GCT 


CAT 


CTG 


CCG 


GCT 


GAA 


TTT 


ACC 


CCG 


GCT 


GTT 


CAT 


GCG 


1 V. 1 




CGA 


CGA 


GTA 


GAC 


GGC 


CGA 


CTT 


AAA 


TGG 


GGC 


CGA 


CAA 


GTA 


CGC 


AGA 


GAC 


ala 


ala 


his 


leu 


pro 


ala 


glu 


phe 


thr 


pro 


ala 


val 


his 


ala 


ser 


leu 


GAT 


AAA 


TTC 


CTG 


GCT 


TCT 


GTT 


TCT 


ACC 


GTT 


CTG 


ACT 


TCG 


AAA 


TAC 


CGT 


CTA 


TTT 


AAG 


CAC 


CGA 


AGA 


CAA 


AGA 


TGG 


CAA 


GAC 


TGA ACG 


TTT 


ATG 


CGA 


asp 


lys 


phe 


leu 


ala 


ser 


val 


ser 


thr 


val 


leu 


thr 


ser 


lys 


tyr 


arg 










/PstI 




















TAA TGA 


C 


TGC 


AGO 


TAC 


ATG 


GAG 


ATT 


AAC 


TCA 


ATC 


TAG 


AGG 


GTA 


TTA 


ATT 


ACT 


G 


ACG 


TCG 


ATG 


TAC 


CTC 


TAA 


TTG 


AGT 


TAG 


ATC 


ACC 


CAT 


AAT 














SD*2 






/Ndet 










ATA 


ATG 


TAT 


CGC 


TTA 


AAT 


AAG 


GAC 


GAA 


TAA 


CAT 


ATG 


ATC 


GAA 


GGT 


CGT 


TAT 


TAG 


CGC 


GCG 


AAT 


TTA 


TTC 


CTC 


CTT 


ATT 


GTA 


TAC 


TAG 


CTT 


CCA 


GCA 



inet(ile glu gly arg) 
FactorX sequence 



GTT 


CAC 


CTG 


ACT 


CCG 


GAA 


GAA 


AAA 


TCC 


GCG 


GTT 


ACT 


GCT 


CTG 


TGG 


GGT 


CAA 


GTG 


GAC 


TGA 


GGC 


CTT 


CTT 


AAA 


AAG 


CGC 


tIAA 


TGA 


CGA 


GAC 


ACC 


CCA 


val 


his 


leu 


thr 


pro 


glu glu 


lys 


ser 


ala 


val 


thr 


ala 


leu trp gly 


AAA 


GTG 


AAC 


GTT 


GAC 


GAA 


GTT 


GGT 


GGT 


GAA 


CGT 


CTG 


GGA 


CGT 


CTG 


CTG 


TTT 


CAC 


TTG 


CAA 


CTG 


CTT 


CAA 


CCA 


CCA 


CTT 


CGA 


GAC 


CCT 


GCA 


GAC 


GAC 


lys 


val 


asn 


val 


asp 


glu 


val 


gly 


gly 


glu 


ala 


leu 


gly 


arg 


leu 


leu 




























/Bgl II 


GTT 


GTT 


TAC 


CCG 


TCG 


ACT 


CAG 


CGT 


TTC 


TTT 


GAA 


TCT 


TTC 


CGA 


GAT 


CTG 


CAA 


CAA 


ATG 


GGC 


ACC 


TGA 


GTC 


GCA 


AAG 


AAA 


CTT 


AGA 


AAG 


CCT 


CTA 


GAC 


val 


val 


tyr 


pro 


trp 


thr 


gin 


arg 


phe 


phe 


glu 


ser 


phe 


gly 


asp 


leu 


TCT 


ACC 


CCG 


GAC 


GCT 


GTT 


ATG 


GGT 


AAC 


CCG 


AAA 


GTT 


AAA 


GCC 


CAT 


GGT 


AGA 


TGG 


TTC 


CTG 


CGA 


CAA 


TAC 


CCA 


TTG 


GGC 


TTT 


CAA 


TTT 


CGG 


GTA 


CCA 


ser 


thr 


pro 


asp 


ala 


val 


met 


gly 


asn 


pro 


lys 


val 


lys 


ala 


his 


gly 


AAA 


AAA 


GTT 


CTG 


GGT 


GCT 


TTC 


TCT 


GAC 


GGT 


CTG 


GCT 


CAC 


CTG 


GAC 


AAC 


TTT 


TTT 


CAA 


GAC 


CCA 


CGA 


AAG 


AGA 


CTG 


CCA 


GAC 


CGA 


GTG 


GAC 


CTG 


TTG 


lys 


lys 


val 


leu 


gly 


ala 


phe 


ser 


asp 


gly 


leu 


ala 


his 


leu 


asp 


asn 



f/S- 5C0NT. 
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/Kpnl 



/Sad 



CTG AAA CGT ACC TTC GCT ACT CTG TCT GAG CTC CAC TGC GAC AAA CTG 
GAC TTT CCA TGG AAG CGA TGA GAC AGA CTC GAG GTG ACG CTG TTT GAC 
leu lys gly thr phe ala thr leu ser glu leu his cys asp lys leu 



CAC GTT GAC CCG GAA AAC TTC CGT CTG CTG GGT AAC GTA CTA GTT TGC 
GTG CAA CTG GGC CTT TTG AAG GCA GAC GAC CCA TTG CAT GAT CAA ACG 
his val asp pro glu asn phe arg leu leu gly asn val leu val cys 



GTT CTG GCT CAC CAC TTC GGT AAA GAA TTC ACT CCG CCG GTT CAG GCT 
CAA GAC CGA GTG GTG AAG CCA TTT CTT AAG TGA GGC GGC CAA GTC CGA 
val leu ala his his phe gly lys glu phe thr pro pro val gin ala 

GCT TAC CAG AAA GTT GTT GCT GGT GTT GCT AAC GCG CTA GCT CAC AAA 
CGA ATG GTC TTT CAA CAA CGA CCA CAA CGA TTG CGC GAT CGA GTG TTT 
ala tyr gin lys val val ala gly val ala asn ala leu ala his lys 



/Spel 



/EcoRI 



TAC CAC 
ATG GTG 
tyr his 



/Hindlll 

TAA TGA 

ATT ACT TCG A 
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FI6 7 



Hemoglobin Beth Israel (beta asn— >ser) : 

/Sad 

C CAC TGC GAC AAA CTG 
TC GAG GTG ACG CTG TTT GAC 
leu his cys asp lys leu 



/Spel 

CAC GTT GAC CCG GAA tct TTC CGT CTG CTG GGT AAC GTA 

GTG CAA CTG GGC CTT aga AAG GCA GAC GAC CCA TTG CAT GAT C 

his val asp pro glu SER phe arg leu leu gly asn val leu 



Hemoglobin Cheverly (beta phe — >ser) : 

/Serll 

CG GTT ACT GCT CTG TGG GGT 
G CGC CAA TGA CGA GAC ACC CCA 
ala val thr ala leu trp gly 

AAA GTC AAC GTT GAC GAA GTT GGT GGT GAA GCT CTG GGA CGT CTG CTG 
TTT CAC TTG CAA CTG CTT CAA CCA CCA CTT CGA GAC CCT GCA GAC GAC 
lys val asn val asp glu val gly gly glu ala leu gly arg leu leu 

/Bglll 

GTT GTT TAC CCG TGG ACT CAG CGT TTC TTT, GAA TCT tct GCA 

CAA CAA ATG GGC ACC TGA GTC GCA AAG AAA CTT AGA aga CCT CTA G 

val val tyr pro trp thr gin arg phe phe glu ser SER gly asp 

Hemoglobin beta val — >ile: 

/Ncol 
CAT GCT 
CA 
gly 



AAA AAA ate CTG GGT GCT TTC TCT 
TTT TTT tag GAC CGA CCA AAG AGA 
lys lys ile leu gly ala phe ser 

/Kpnl 

CTG AAA GGT AC 
GAC TTT C 
leu lys gly 



GAC GGT CTG GCT CAC CTG GAC AAC 
CTG CCA GAC CGA GTG GAC CTG TTG 
asp gly leu ala his leu asp asn 



Hemoglobin Kansas (beta asn — >thr) 

/Sad 

C CAC TGC GAC AAA CTG 
TC GAG GTG ACG CTG TTT GAC 
glu leu his cys asp lys leu 

/Spel 

CAC GTT GAC CCG GAA acc TTC CGT CTG CTG GGT AAC GTA 

GTG CAA CTG GGC CTT tgg AAG GCA GAC GAC CCA TTG CAT GAT C 

his val asp pro glu asn phe arg leu leu gly asn val leu val 
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